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PERSONAL VALUES AS FACTORS IN ANTI-SEMITISM * 


RICHARD I. EVANS 


University of Houston 


so-called prejudiced individuals have been 

compared with the personalities of the so- 
called unprejudiced individuals. This compar- 
ison has resulted not only in a clearer picture 
of the personality structure of the preju- 
diced individual, but has also shown that 
prejudice toward minority groups tends to 
be only one of a constellation of personality 
factors that interact in such a manner that 
the individual possessing the constellation 
might well be described as an “authoritarian 
personality,” a phrase suggested by Adorno, 
et al. (1). 

Examination of the studies of the “authori- 
tarian personality” indicates that the role that 
values may play in such a personality pat- 
tern has not been systematically examined. 
Since the integration of personality is as 
Krech and Crutchfield (7) suggest, “mainly 
possible through the individual’s system of 
values,” ideals, and ideology,” the function of 
personal values would appear to be crucial. 

The present paper deals with an investiga- 
tion of one aspect of this possibility, i.e., that 
a relationship exists between personal values 
and one correlate of authoritarianism, namely, 
anti-Semitism. 

A series of hypotheses suggesting the 
nature of this relationship was deduced in this 
study. One basis for deducing these hypotheses 
resulted from examining a group of person- 
ality factors used by the California group in 
the Authoritarian Personality(1) in de- 
veloping the F scale which appeared to 
bear 2 logical relationship to Spranger’s val- 
ues (theoretical, economic, aesthetic, social, 


I’ the past several years the personalities of 


1 The present paper represents a portion of the dis- 
sertation presented in partial fulfillment of the require- 
ments for the degree of Doctor of Philosophy at 
Michigan State College, 1950. The writer wishes to 
express his thanks to Drs. Milton Rokeach (doctoral 
committee chairman), M. Ray Denny, Donald M. John- 
son, Harry Sundwall, and S. Howard Bartley for their 
invaluable advice and criticism. 

Portions of this paper were presented at the Texas 
Psychol. Ass., 1950, and at the Amer. Psychol. Ass., 
Chicago, 1951. 

2 The italics are the writer’s. 


political, religious). These personality factors 
are conventionalism, anti-intraceptiveness, 
extraceptiveness, stereotypy, preoccupation 
with power and toughness, and projectiv- 
ity. If it could be reasoned that certain of 
these factors as described by Adorno et al. (1) 
were consistent with the description of 
Spranger’s six values by Vernon and Allport 
(13), it was presumed that a positive rela- 
tionship should exist between the value and 
anti-Semitism. If from these descriptions it 
appeared that the personality factor and the 
value were inconsistent, it was presumed 
that a negative relationship should exist.* 

Another basis for constructing the hypoth- 
eses resulted from analyzing prejudice studies 
such as those of Allport and Kramer (2), 
Hartley (5), as well as Adorno et al. (1) with 
respect to references to values. It was found 
that references to values which parallel Sprang- 
er’s six values were made and that these ref- 
erences suggested that certain value differences 
are present in tolerant and intolerant indi- 
viduals.* 

As a result of this theorizing the following 
hypotheses were formulated: 

1. In view of the inferred relative intra- 
ceptiveness and the rejection of rigid con- 
ventionality and stereotypy by theoretical in- 
dividuals, the presence of strong theoretical 
values in the personality structure of individ- 
uals would be incongruent with anti-Semi- 
tism. 

2. In view of the inferred anti-intraceptive- 
ness, extraceptiveness, stereotypy, and projec- 
tivity of the highly economic individual, 
strong economic values would be congruent 
with anti-Semitism. 

3. In view of the inferred intraceptiveness 
and reactions against rigid conventionalism, 
extraceptiveness, and power by aesthetic in- 
dividuals, strong aesthetic values and anti- 
Semitism would be incongruent. 


8 For a detailed description of this procedure and the 
variables used, see Evans (3). 

*For a detailed description of the analysis, see 
Evans (3). 
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4. In view of the inferred intraceptive- 
ness and rejection of extraceptiveness, stereo- 
typy, and power by the social individual, 
strong social values and anti-Semitism would 
be incongruent. 

5. In view of the inferred anti-intracep- 
tiveness, extraceptiveness, preoccupation with 
power, and projectivity of the highly political 
individual, strong political values and anti- 
Semitism would be congruent. 

6. In view of the inferred intraceptiveness, 
and rejection of extraceptiveness and power 
by the religious individual, strong religious 
values would be incongruent with anti-Semi- 
tism. 

7. In view of the inferred relationship of 
the values to certain personality factors (in- 
traceptiveness, projectivity, etc.), individuals 
relatively high in anti-Semitism are domi- 
nated by economic and political values; indi- 
viduals relatively low in anti-Semitism by 
aesthetic, theoretical, social, and religious 
values. 

Another aspect of the problem is con- 
cerned with the role of values in the individ- 
ual’s justification of the absence or presence 
of anti-Semitic attitudes. For example, if in- 
dividuals are asked to justify the fact that 
they are or are not prejudiced against Jew- 
ish people by stating as many reasons as 
they can for being or not being anti-Semitic, 
will an analysis of these reasons reveal the 
definite influences of the individual’s domi- 
nant values in their composition? 

Still another aspect of the problem is sug- 
gested. To what extent is the number of rea- 
sons given for being or not being anti-Semitic 
an index of the actual intensity of the preju- 
diced attitudes? Would the prejudiced indi- 
vidual be able to give more reasons for and 
fewer reasons against being prejudiced than 
the unprejudiced individual? 

Since many recent studies® dealing with 
personality and perception have demon- 
strated the dynamic qualities of values as 
measured by the Allport-Vernon Study of 
Values, it could be expected that values might 
be reflected in the reasons individuals gave 
for being or for not being anti-Semitic. Fur- 
thermore, the operation of the mechanism of 
rationalization in the personality of the indi- 


5 For examples of these studies see McGinnies (9), 
McGinnies and Bowles (10), Postman ef al. (11). 
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vidual would suggest that an individual 
would have a number of reasons at his dis- 
posal to justify his attitudes. More specif- 
ically, with respect to an attitude toward 
Jewish people, it would be expected that, con- 
sistent with the operation of rationalization, 
an individual would have reasons supporting 
his attitude, regardless of whether or not it 
was favorable. On the basis of these deduc- 
tions, the following hypothesis was formu- 
lated: 

8. In view of the dynamic qualities of the 
values and the operation of the rationaliza- 
tion mechanism, dominant values will be 
reflected significantly in reasons given for 
being and not being anti-Semitic, and total 
number of reasons given will differentiate 
highs and lows in anti-Semitism. 


METHOD 


In order to test these hypotheses, the Allport- 
Vernon Study of Values (13), used to measure 
Spranger’s six values, the Levinson-Sanford Anti- 
Semitism scale (8), used to measure anti-Semitism, 
and a quasi-role-playing device designed to obtain 
the reasons subjects (Ss) could give for being and 
not being anti-Semitic were administered to a group 
of 169 white, non-Jewish students at Michigan 
State College. The quasi-role-playing device is a 
paper and pencil technique of role playing with the 
following instructions: “Some people are prejudiced 
against Jewish people; some people are not. For a 
moment imagine yourself as an individual who is 
distinctly prejudiced against Jews. State as many 
reasons for the prejudice as you can. Now 
imagine yourself as an individual who is distinctly 
not prejudiced against Jewish people. State as 
many reasons for not being prejudiced as you can.” 

The data were analyzed in the following manner: 

1. In order to test the hypothesis concerning the 
relationship between each of the six values and anti- 
Semitism, the Pearsonian product-moment correla- 
tion coefficient between each distribution of value 
scores and anti-Semitism scores was computed. 

As a further means of analyzing the nature of 
these relationships, the distribution was divided into 
four quarters on the basis of the anti-Semitism 
scores. The first quarter included 25 per cent of 
the sample who scored lowest on the anti-Semitism 
scale (low prejudice group); the second quarter 
included the next 25 per cent (medium low preju- 
dice group); the third quarter the next highest 
25 per cent (medium high prejudice group); the 
fourth quarter the highest scorers on the scale (high 
prejudice group). 

The mean value scores in each of these quarters 
were then computed, and a statistical comparison 
among the means, using Fisher's ¢ratio test (4), 
was made. 

2. In order to test the hypothesis concerning the 
value patterns present in high and low prejudice 
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groups, the scores made by each S on the six value 
subscales were transformed to rankings from 1 to 6. 
The value that the subject scored the highest on 
was ranked 1, the next highest 2, and so on. 

The mean rankings for the values in each of the 
four quarters were then determined. These mean 
rankings were in turn converted to rankings from 
1 to 6. In each quarter, the value with the highest 
mean rank was ranked 1, the next highest 2, and 
so on. An order of dominance pattern of the six 
values in each of the four quarters was thus 
computed, 

The justification for the use of this ranking proce- 
dure stems from the fact that the raw scores ob- 
tained on the six value subscales are in reality only 
relative scores, since the combined total of the raw 
scores on the six value subscales must always be 180. 

3. To test the hypothesis concerning reasons for 
prejudice and the values, an analysis in the high 
and low prejudice groups of all the reasons for 
being and not being prejudiced was made by the 
writer and two other judges. This analysis was 
designed to compare the number of reasons in the 
high and low groups which clearly reflected any 
of the six values, in terms of Spranger’s descriptions. 
If all three judges agreed that a reason was influ- 
enced by a particular value, it was tallied under 
the value in question. If any disagreement occurred, 
it was tallied under “miscellaneous reasons.” It 
was thus possible to determine the extent to which 
dominant values determined the content of reasons 
for prejudice in the high and low groups. 

The total number of reasons in the high and low 
prejudice groups was also tallied. The significance 
of the difference between the means of the anti- 
Jewish reasons in the high and low groups was then 
determined. This indicated the extent to which 
number of reasons is an index of the relative inten- 
sity of anti-Semitism. 


REsuULTs 


The first hypothesis states that theoretical 


values and anti-Semitism are incongruent. 
In terms of our scales, therefore, a negative 
relationship should exist between scores on 
the anti-Semitism scale and scoring on the 
theoretical value subscale. It can be seen from 
Table 1 that the correlation between theoret- 
ical values and anti-Semitism is —.069, in the 
hypothesized direction, but not statistically 
significant. On this basis, it appears that the 
hypothesis is not upheld. 

A study of the nature of this relationship 
by virtue of a statistical comparison among 
the four quarters of the anti-Semitism dis- 
tribution ® indicates that the only statistically 
significant difference (.o1 level of confidence) 


6 Table 2 shows the means, standard deviations, and 
standard errors of the means of value scores for anti- 
Semitism quarters 1, 2, 3, 4. 
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TABLE 1 


Means, STANDARD DEVIATIONS AND THE Propuct- 
MoMENT CorRELATIONS BETWEEN THE SIX 
VALUES AND ANTI-SEMITISM 


(N=169) 








SCALE M 





128.11 +7 _ 
27.63 -4 .069 
29.32 ° +311 
27.24 . -249 
30.57 . -173 
31.22 ; -322 
33-14 8 +136 





exists between Quarters 1 and 3. (Table 3)." 
This fact in itself does not lend sufficient sup- 
port to the hypothesis to uphold it. 

The second hypothesis states that economic 
values and anti-Semitism are congruent. In 
terms of our scales, therefore, a positive rela- 
tionship should exist between scores on the 
anti-Semitism scale and scores on the eco- 

7 Table 3 presents the statistical comparisons among 


the four anti-Semitism quarters with respect to mean 
value scores. 


TABLE 2 
MEANS, STANDARD DEVIATIONS, AND STANDARD ERRORS 
OF THE MEAN oF VALUE Scores FoR ANTI- 
SEMITISM QUARTERS I, 2, 3, AND 4 
(Low, Medium Low, Medium High, High Groups) 





N M SD 





SE/M 


g 


VALUE 





42 29.69 
42 25.51 
43 26.74 
42 28.55 

2 27.83 
42 27.60 
43 29. 
42 32. 
42 28. 
43 29. 
42 26. 
42 23. 
42 32. 
43 31. 
42 


1.14 
1.23 
-91 
-78 
-84 
-95 
-99 
1.04 
1.48 
1.08 
1.14 
1.01 
1.27 
1.06 
*77 
1.02 
1.03 
-99 
+73 
-80 


Theoretical 


Economic 


Aesthetic 


29. 
42 28. 
43 29. 
42 33- 
42 33- 
42 33- 
43 33- 
42 32. 
42 2. 


Political 


Religious 
1.71 
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nomic value subscale. The r between these 
scores (Table 1) is .311, which is significant 
at the .o1 level of confidence. This would ap- 
pear to uphold the hypothesis. 

A study of the nature of this relationship 
affords further insight into this finding. It 
will be observed from Table 3 that statistically 


TABLE 3 
STATISTICAL COMPARISON OF VALUE Scores IN ANTI- 
SEMITISM QUARTERS I, 2, 3, AND 4 
(Low, Medium Low, Medium High, High Groups) 








VALUE Qrr. Diff/M SE 





-38 


Theoretical 
.46 
33 


-34 
+30 
-4! 
+37 
-43 
-79 
. 87 
.48 


Economic 


Aesthetic 


-57 
-52 


a ne ee ee oe ee | 


Social 
.48 
-49 


-33 
25 


Political +30 


29 
+27 
-43 
-23 
.o8 
8o 
8o 
12 
15 
I! 
76 


ee ee eee ee ee 


Religious 


49 
+13 
-79 


+30 


4 
3 
4 
3 
4 
4 
3 
4 
3 
k4 
c4 
3 
4 
3 
‘4 
x 4 
3 
& 4 
3 
4 
c4 
3 
4 
c3 
4 
x 4 
‘3 
c4 
x 3 


“wenn 


& 
a 





significant differences exist between Quar- 
ters 1 and 4 (.o1 level), 2 and 4 (.o1 level), 
and 3 and 4 (.05 level). The results of the 
interquarter analysis, therefore, further sup- 
port the hypothesis. 

The third hypothesis states that aesthetic 
values and anti-Semitism are incongruent. In 
terms of our scales, therefore, a negative rela- 
tionship should exist between scores on the 
anti-Semitism and scores on the aesthetic 
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value subscale. The r between these scores 
(Table 1) is —.249, which is significant at the 
ot level of confidence. This would appear to 
uphold the hypothesis. 

Further insight into this finding can be 
obtained by study of the nature of this rela- 
tionship. It can be seen from Table 3 that 
statistically significant differences at the .o1 
confidence level exist between Quarters 1 and 
4, 2 and 4, and 3 and 4, which further sup- 
port the hypothesis. 

The fourth hypothesis states that social 
values and anti-Semitism are incongruent. 
In terms of our scales, therefore, a negative 
relationship should exist between scores on 
the anti-Semitism scale and scores on the 
social subscale. An r of —.173 (Table 1), sig- 
nificant at the .05 level of confidence, between 
social and anti-Semitism scale scores tends to 
support the hypothesis. 

The reader will also note (Table 3) that 
statistically significant differences at the .05 
level of confidence exist between Quarters 1 
and 4 and 2 and 4. 

The fifth hypothesis states that political 
values and anti-Semitism are congruent. In 
terms of our scales, therefore, a positive rela- 
tionship should exist between scores on the 
anti-Semitism and political value scales. A 
correlation coefficient of .33 (Table 1), sig- 
nificant at the .or level of confidence, was 
found between these two variables. This up- 
holds the hypothesis. 

It can be observed from Table 3 that sta- 
tistically significant differences at the .o1 con- 
fidence level exist between Quarters 1 and 4, 
1 and 3, 2 and 4, and 4 and 3 as well. 

The sixth hypothesis states that religious 
values and anti-Semitism are incongruent. In 
terms of our scales, therefore, a negative rela- 
tionship should exist between scores on the 
anti-Semitism scale and scores on the religious 
value subscale. It was found (Table 1) that a 
correlation of —.136, not statistically signif- 
icant, exists between religious values and anti- 
Semitism. This does not support our hypoth- 
esis satisfactorily, so the hypothesis must be 
rejected. Furthermore, even though this r 
is in the hypothesized direction, an analysis 
of the nature of this relationship indicates 
that interquarter differences do not approach 
statistical significance (Table 3). 

We may conclude this section of the re- 
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sults by adding that the anti-Semitism dis- 
tribution was also broken down according 
to sex, and an investigation of the relation- 
ship between each value and anti-Semitism 
in the male and female groups was made. 
All general trends presented above were sup- 
ported. 

The seventh hypothesis states that anti- 
Semitic individuals are dominated by political 
and economic values, while individuals rela- 
tively less anti-Semitic are dominated by 
aesthetic, theoretical, social, and religious val- 
ues. As was suggested in the methodology 
section of the present paper, a test of this 
hypothesis was effected by computing an or- 
der of dominance pattern of the six values 
in terms of rankings from 1 to 6 for the four 
quarters of the anti-Semitism distribution 


(Table 4). 


TABLE 4 


Means, STANDARD Errors, AND STANDARD Devt- 
ATIONS OF THE Six VaLue Scores CoNVERTED 
to RANKINGS IN ANTI-SEMITISM 
QUARTERS I, 2, 3, AND 4 
(Low, Medium Low, Medium High, High Groups) 








VALUES 


AE T Ss R E P 





Qrr. 





.00 -17 62 4.69 4.83 
<a ~ 408 P er: 
-35 1.36 .41 1.10 1.25 
02 9.50 4: -00 4.29 4.05 
+27 +26 . +30 +15 +24 
-7O 2.65 1. -.96 .97 1.51 
4 48% -38 3.40 2.74 
ae 0. 2. 
.58 1.48 1. -93 1.62 1.45 
-17 4-69 3. -57 2.36 2.31 
-19 .09 ‘ .28 -20 +21 
+23 58 1. -76 1.30 13.37 





When the mean rankings in the low preju- 
dice group (first quarter) are converted to 
rankings from 1 to 6 they appear in the fol- 
lowing order from highest to lowest: aesthetic, 
social, religious, theoretical, economic, and 
political. 

In the medium-low prejudice group (the 
second quarter) the mean rankings appear 
in the following order: religious, aesthetic, 
social, theoretical, political, and economic. 

In the medium-high prejudice group (the 
third quarter) the mean rankings appear in 
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the following order: political, religious, eco- 
nomic, social, aesthetic, and theoretical. 

In the high prejudice group (the fourth 
quarter) the mean rankings appear in the 
following order: political, economic, religious, 
social, theoretical, and aesthetic. It will be 
noted that with the exception of the medium- 
high prejudice group (the third quarter), 
which appears to be dominated by political 
and religious values rather than by political 
and economic values as hypothesized, the pre- 
dicted value patterns appear to exist. 

The eighth hypothesis states that the dom- 
inant values in the high and low anti-Semi- 
tism groups will function as significant de- 
terminers of the types of reasons given for 
being and not being prejudiced, and the num- 
ber of reasons given will be an index of the 
intensity of anti-Semitism. 

As has been established in the hypothesis 
discussed above, the dominant values in the 
high group are political and economic, while 
the dominant in the low group are aesthetic, 
theoretical, social, and religious. Conse- 
quently, in order to support the first part of 
the hypothesis, the number of economic and 
political reasons given by the high group 
should be significantly greater than the num- 
ber given by the low group, while the num- 
ber of aesthetic, theoretical, social, and re- 
ligious reasons given by the low group should 
be significantly greater than the number 
given by the high group. 

Table 5 presents the number of value-influ- 
enced reasons and other reasons given by 
high and low anti-Semitism groups. It will 
be noted that a trend exists indicating the 
operation of economic and political values to 
a greater extent in the high anti-Semitism 
group than in the low, both in anti- and pro- 
Jewish reasons, lending some support to our 
hypothesis. A slight trend exists indicating 
the greater influence of theoretical and re- 
ligious values in the pro-Jewish reasons given 
by the low group. However, neither of these 
trends is statistically significant. Thus, while 
these results are highly suggestive, the first 
part of the hypothesis has not been clearly 
upheld. 

In order to uphold the second part of the 
hypothesis, the number of reasons given for 
being anti-Semitic should be significantly 
greater in the high prejudice group than in 
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the low prejudice group, while the number of 
reasons given for not being anti-Semitic 
should be significantly greater in the low 
prejudice group. 

Table 6 presents a statistical comparison of 
the reasons for prejudice given by the high 
(the fourth quarter) and low (the first 
quarter) groups. It will be noted that the 
high group gave an average of 3.88 reasons 
for not being prejudiced, while the low group 
gave 2.67. The difference of 1.21 is significant 
at the .o1 level of confidence. 
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Discussion oF REsuLTs 


In the results it was stated that theoretical 
values did not appear to be significantly re- 
lated to anti-Semitism in the hypothesized 
negative direction. Why wasn’t the hy- 
pothesis upheld? A possible answer to this 
question is found in our theoretical consid- 
eration. We inferred that the theoretical 
individual would tend to be intraceptive. 
However, some theoretical individuals, be- 
cause of being strongly empirical in their 
approach to problems, might tend to be 


TABLE 5 


NuMBER OF VALUE-INFLUENCED AND MISCELLANEOUS Reasons IN HiGH anv Low 


ANTI-SEMITISM GROUPS 








TYPE oF 





VALUE 





REASON 


AE 





Anti-Jewish 


Pro-Jewish Low 





It will also be noted that the high group 
gave an average of 2.88 reasons for not being 
prejudiced, while the low group gave 3.57. 
The difference of .6g is significant at the .05 
level of confidence. 

These data appear to uphold the second 
part of our hypothesis. It may be concluded 
that the number of reasons given by groups 
for being and not being anti-Semitic serves 
as a reliable index of whether the groups 
are high or low in anti-Semitism. 


TABLE 6 
A Sratisticat CoMPARISON OF THE NUMBER OF 
PreyUDICED AND Non-Preyupicep REASONS 
Given sy THose In THE HicH anp Low 
Groups ON THE ANTI-SEMITISM SCALE 


SE 
M SD SE/M Diff/M Diff/M ¢ 





Preyupicep REAsONs 





Non-PreyupiceD REASONS 


somewhat extraceptive. This raises the possi- 
bility that both intraceptive and extraceptive 
theoretical individuals might score high on 
the theoretical value scale. Since intraception 
is related to tolerance and extraception to 
intolerance, if even a relatively few extra- 
ceptive theoretical individuals scored high on 
the scale, it could materially reduce the sig- 
nificance of any negative relationship between 
theoretical values and anti-Semitism that 
might be present. 

The possibility of the presence of both in- 
traceptive and extraceptive theoretical indi- 
viduals provides a basis for further research. 
It is possible that the difference may be sim- 
ilar to the difference that may exist between 
social scientists and physical scientists. A 
comparison of relative strength of prejudice 
between these two groups may in itself have 
some bearing on this problem. From per- 
sonal experience, the writer has observed that 
the social scientists are relatively less intol- 
erant than the physical scientists. Of course, 
a critical examination of this possibility 
would require the utilization of a scale that 
differentiated between intraceptive and ex- 
traceptive individuals, and administering it to 
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high scorers on the Allport-Vernon theoret- 
ical scaie. 

The hypothesized negative trend in the 
relationship between religious values and anti- 
Semitism was also found to be insignificant. 
In addition, the findings relative to the posi- 
tion of religious values in the value patterns 
of the four anti-Semitism quarters appear to 
be rather paradoxical. Religious values ap- 
pear to play an equally important part in 
both extreme quarter groups. The reader will 
recall that in the medium low group, reli- 
gious values rank 1 and in the medium high 
group they rank 2, while in both the high 
and low anti-Semitism groups religious val- 
ues rank 3. What explains these appar- 
ently paradoxical findings? 

A possible explanation is to be found in 
a consideration of the type of religiousness 
that is measured by the religious value scale. 
Kirkpatrick (6) has noted that humanitarian- 
ism and religion in the narrow sense that it 
is present in many individuals are negatively 
correlated. However, Clark, in Values in Per- 
sonality Research (14), and Rosenblith (12) 
suggest the possibility of the presence of a 
positive or humanitarian religion as well as 
a narrow religion that tends to breed in- 
creased ethnocentrism. 

It is possible that the religious scale does 
not reliably differentiate between individuals 
who are religious in the humanitarian sense 
and those religious in the narrow sense. If 
this is a valid assumption, it would be ex- 
pected that the low prejudice groups would 
score high in religious values, because of the 
presence of a strong humanitarian religious- 
ness, while the high prejudice groups would 
score high in religious values because of the 
presence of a strong, narrow type of reli- 
giousness. This would explain the apparent 
ambiguity in the results. 

The absence of a statistically significant 
trend indicating that the dominant values 
determined the kinds of reasons given by the 
high and low anti-Semitism groups, can be 
explained on the basis of an analysis of the 
reasons that were given. 

It was found that the majority of the Ss, 
both prejudiced and unprejudiced, were in- 
fluenced by existing stereotypes concerning 
the Jews in constructing their reasons. Most 
of the prejudiced reasons merely indicated 
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an acceptance of a prevalent negative stereo- 
type, while most of the unprejudiced reasons 
merely discounted prevalent negative stereo- 
types. In other words, most reasons had to 
do with an acceptance or rejection of the 
“clannishness,” religious difference, “differ- 
ent” appearance, “shrewdness,” “shady” finan- 
cial activities, or similar negative stereotypes 
of the Jews. It appears that stereotypes, 
therefore, were more important as determin- 
ers of reasons given by both groups than 
were the values as such. 

Incidentally, with respect to the qualitative 
data, the following is a typical response. 
(This respondent is male, unmarried, high 
scorer—upper 25 per cent—on the A-S Scale.) 
Reasons for being prejudiced against Jewish 
people: “Usually have a foreign look, always 
penny-pinching, tend to be showy, strike a 
hard bargain, talk with hands in fast lan- 
guage, have large families.” 

This same S gave the following reasons for 
not being prejudiced against Jewish people: 
“Same color as I am, intelligent in most mat- 
ters, their religion is their business not mine, 
I’m in love with a Jewish girl.” 

These data demonstrate the existence of a 
provocative basis for a qualitative analysis 
of the responses. This analysis is being com- 
pleted and will be reported at a later date. 


SUMMARY 


The present study was concerned with the 
problem of personal values as factors in anti- 
Semitism. In order to measure personal val- 
ues, the Allport-Vernon Study of Values was 
used. Anti-Semitism was measured by the 
Levinson-Sanford Anti-Semitism Scale, 

Hypotheses were formulated stating that 
theoretical, aesthetic, social, and religious val- 
ues are negatively related to anti-Semitism, 
while economic and political values are posi- 
tively related to anti-Semitism. Another 
hypothesis stated that in an order of domi- 
nance pattern consisting of all six values, high 
scorers on the anti-Semitism scale would be 
more dominated by economic and political 
values, while low scorers would be more 
dominated by theoretical, aesthetic, social, and 
religious values. A final hypothesis stated 
that individuals significantly reflect their 
dominant values in the reasons they give for 
being and not being anti-Semitic, and that 
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the number of reasons given serves as a 
reliable index of the position on the anti- 
Semitism scale. 

In order to test these hypotheses, the Study 
of Values, the anti-Semitism scale, and a 
quasi-role-playing device designed to obtain 
the reasons Ss could give for being and not 
being anti-Semitic were administered to a 
group of 169 Michigan State College students. 
Anti-Semitism and each value were corre- 
lated. The anti-Semitism distribution was di- 
vided into four quarters, which were statis- 
tically compared with respect to each value. 
The scores in each quarter were converted 
to rankings from 1 to 6, and the upper and 
lower quarters were statistically compared 
with respect to the number of value-influ- 
enced reasons for prejudice given and the 
total number of reasons given. Finally, cer- 
tain group differences on the anti-Semitism 
and value scales were statistically compared. 

The following conclusions were reached on 
the basis of the tests of the hypotheses: 

1. Significant negative relationships exist 
between aesthetic and social values and anti- 
Semitism. (Hypothesis confirmed.) 

2. Significant positive relationships exist 
between political and economic values and 
anti-Semitism. (Hypothesis confirmed.) 

3. No significant relationships between 
theoretical and religious values and anti- 
Semitism exist, although a slight trend in- 
dicating a negative relationship was found. 
(Hypothesis not confirmed.) 

4. The value patterns, in the order of 
dominance, for the four anti-Semitism quar- 
ters are as follows: highs—political, economic, 
religious, social, theoretical, aesthetic; me- 
dium highs—political, religious, economic, 
social, aesthetic, theoretical; medium lows— 
religious, aesthetic, social, theoretical, po- 
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litical, economic; lows—aesthetic, social, re- 
ligious, theoretical, economic, political. (Hy- 
pothesis essentially confirmed.) 

5. Dominant value patterns were not sig- 
nificantly reflected in reasons given for being 
and not being anti-Semitic, but number of 
reasons given proved to be a reliable index 
of the position on the anti-Semitism scale, 
with highs giving significantly more preju- 
diced and fewer unprejudiced reasons than 


lows. (Hypothesis partly confirmed.) 
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PERSONAL VALUES AND ESP SCORES * 


GERTRUDE RAFFEL SCHMEIDLER 
City College of New York 


attitude toward the experiment is an 
important determinant of the results. 
Subjects (Ss) with an unfavorable set will 
perceive material differently, or remember it 
differently, or respond differently to it, from 
those with a favorable set. This factor is 
taken into account when an experiment has 
been made so painful, uncomfortable, or 
embarrassing that it can be assumed that all 
Ss are under stress, or when an experimenter 
(E) reports that rapport was good for all Ss. 
But there are other experiments, of which the 
following is an example, where it must be 
assumed that motivational patterns vary be- 
cause different Ss respond to different aspects 
of the stimulus situation. In such a case, 
knowing S’s values can aid in predicting 
which aspects of the situation will be most 
effective in determining his responses. 
During an extended series of experiments 


Il some research~projects, the subject’s 


on ESP we have inquired into one factor 
which might be expected to influence S’s atti- 
tude: whether he accepts the possibility of 
paranormal success under the conditions of 
the experiment, or whether he is convinced 


that such success is impossible. (For con- 
venience, in order to avoid repeating this 
cumbersome differentiation, Ss who accept 
the possibility of paranormal success under 
the experimental conditions have been called 
“sheep” and Ss who reject it have been called 
“goats.”) Our over-all finding has been that 
the sheep tend to have higher average ESP 
scores than the goats (4). But it has become 
more and more evident that Ss’ answers to 
this intellectual question do not differentiate 
them neatly into groups with favorable or 
unfavorable attitudes toward the experiment 
as a whole. Some of the sheep are bored or 
irritated; some of the goats, even though they 
think any success they have is due to chance, 
will say that they like guessing games and 

1 ESP stands for extrasensory perception, a term used 
to denote telepathy, clairvoyance, or some similar process. 

This research was financed by the Richard Hodgson 


Fund of Harvard University, through the kindness of 
Dr. Gardner Murphy, who was Hodgson Fellow. 
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enjoy the session. The author has therefore 
suggested elsewhere (6) that the sheep-goat 
dichotomy will be most meaningful for Ss 
to whom theoretical problems are important 
(specifically, Ss who score high on theoretical 
values on the Allport-Vernon Study of - 
Values [1]), and consequently that the sheep 
to whom theoretical values are important 
should have distinctly higher average ESP 
scores than the goats to whom theoretical 
values are important. 

The following paper reports data that were 
collected to test this hypothesis. They were 
collected after the hypothesis had been stated. 


METHOD 


The Ss of the experiment were the author’s 
students in introductory psychology during the 
academic year 1950-1951. The experiments were 
conducted during a regular class period. Four 
classes were tested. 

The stimulus material consisted of lists of 25 
items. The items were chosen from among five 
possibilities. In one set of lists, each of the five 
items represented a color: red, green, yellow, blue, 
or brown. In another set of lists the items repre- 
sented the conventional five ESP symbols: cross, 
square, circle, star, or a set of three wavy lines. 
The order of the items was determined from a 
table of random numbers. The Ss’ task was to 
guess the order of the items in concealed lists. (For 
a more detailed description of the method, see [5].) 
With lists prepared in this way, the average number 
guessed correctly, if choice alone determined the 
guesses, would approximate five, with a standard 
deviation of two. 

A permanent record of the targets was prepared 
before the experiment, and was put into an opaque 
envelope, where it remained until Ss completed 
their responses. The responses were written on 
mimeographed record blanks. These blanks were 
collected before Ss were permitted to see the stimu- 
lus list. The E did not know what the target was 
at the time of giving instructions, and thus could 
not have inadvertently guided the correct choice. 
All responses were given two independent scorings. 
All Ss were supposed to guess at eight lists, but 
some latecomers missed the beginning of the session. 

Under these conditions, correct choice of the 
target (or systematic avoidance of the target) could 
not have been due to sensory cues, since the lists 
were concealed from both S.and E. It could not 
have been due to Ss’ inferences about the stimulus 
order, since that order was random. It is highly 
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unlikely that it was due to idiosyncrasies of Ss’ 
response patterns which happened to coincide with 
the stimulus order, since 32 separate target lists 
were employed. It is not likely that it was due 
to scoring errors, since the data were carefully 
checked twice. It could not be assigned to statisti- 
cal artifact, for two reasons. One is that the 
method of determining~chance expectancy has been 
approved by competent statisticiafis. The second is 
that one group of our Ss scored above mean chance 
expectancy while the other group scored below; and 
our conclusions rest upon group differences, not 
upon absolute scores. Statistically significant dif- 
ferences in our data may therefore be attributed to 
that capacity which we call ESP. 

After their responses were completed, but before 
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The administration and scoring of the Allport- 
Vernon Study of Values followed the standard 
instructions printed on the forms (1). 


REsutts 

The data are summarized in Table 1. 
There was a total of 959 ESP runs, each of 
which, it will be remembered, consisted of 
25 separate responses. These runs yield a 
total of 23,975 responses. By mean chance 
expectation, about one-fifth or a total of 
approximately 4,795 of these responses should 
have matched the target. The obtained total 
was 4,894, giving a deviation from chance 


TABLE 1 


Retation Between ESP Scores ano THeoreticaL VaLues For Susyects WHo Accept THE PossiBILity 
OF PARANORMAL SUCCESS UNDER THE CONDITIONS OF THE EXPERIMENT (SHEEP) AND 
ror Susysets Wuo Reject THis Posstsitity (Goats) 








NUMBER DEVIATION 
OF 


Runs 


Group 


FROM CHANCE 
EXPECTANCY 


DIFFERENCE BETWEEN SHEEP 


MEAN F 
AND GoaTs 


ESP 
Score 





MEAN t 





All Subjects 
Sheep 
Goats 


+151 
Ss below goth percentile 
Sheep 
Goats 
Ss at or above goth percentile 
Sheep 
Goats 


+69 
—I19 


+82 
—33 
Ss at or above gsth percentile 
Sheep 
Goats 


+38 
—_B84 
Ss at rooth percentile 
Sheep 
Goats 


+37 
—4 








refers to Theoretical Value 
they knew which responses were correct, all Ss were 
required to classify themselves in respect to their 
intellectual attitude toward the experiment. The 
specific question put to them was, “Do you think 
that, under these conditions, correct responses might 
be due to clairvoyance or telepathy? Or do you 
think that, under these conditions, correct guesses 
must be due to chance alone?” Although many 
shadings of opinion are possible here, all responses 
were divided into the two categories above. In 
answering the question, Ss were instructed that if 
there was any shadow of doubt in their minds,. or 
if they were open to conviction, they should classify 
themselves as believing that correct responses might 
be due to paranormal factors (i.e, as sheep), and 
that they should classify themselves as rejecting the 
paranormal success under these con- 
ditions (i.e., as goats) only if they were completely 
convinced—no matter how the results came out— 
that successes were due only to chance. 


pe sssibility of 


scores 


This figure is not significantly 
different from chance expectation. 

When the total group is divided into sheep 
and goats, we find that the sheep scored sig- 
nificantly better than the goats (pP,iy;—=.003).? 

When the groups are subdivided according 
to their scores on the Study of Values, it is 


of +119. 


2 It may be remarked here that the difference between 
sheep and goats is greater than was expected. In six 
other academic years, during which similar group ex- 
periments were performed with more than one thousand 
Ss, the mean difference between sheep and goats has 
varied from —.08 to +.34; and the over-all mean differ- 
ence was only .15 in favor of the sheep. Because of the 
large number of runs, however, this difference is signifi- 
cant at the .o005 level. In individual tests, the +.39 
mean difference between 1100 runs made by sheep and 
943 runs made by goats was significant at the .oooo1 
level. 
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apparent at a glance that the difference be- 
tween the sheep and the goats is accentuated 
for Ss with high theoretical scores (Table 1 
and Fig. 1). Since it may be of interest to 
know what proportion of Ss had scores above 
or below the level of mean chance expectancy, 
the data in Table 2 are given in terms of num- 
ber of Ss. None of our computations is 
based on this analysis. 

For the 96 Ss who fell below the goth per- 
centile in theoretical values, the sheep had 


i i = 
re) Ne o 


MEAN ESP SCORE 
i 
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high on the theoretical values scale of the 
Allport-Vernon Study of Values. 


Discussion 


When examination of the data shows that 
comparatively high ESP scores were made 
by sheep to whom theoretical values were 
important, the finding requires little com- 
ment, other than that it is consistent with the 
prediction made before the experiment was 
begun. In any such task, Ss who can iden- 

















Below 90th 
Percentile 


At or above 
$Oth Percentile 


At 100th 
Percentile 


At or above 
95th Percentile 


SCORES ON THEORETICAL VALUES OF ALLPORT-VERNON STUDY OF VALUES 


Fic. 1. 


Mean ESP Scores or Susyects WHo Acceprep THE Possipitity OF PARANORMAL SUCCESS UNDER 


THE CONDITIONS OF THE EXPERIMENT (SHEEP) CoMPARED WITH MEAN ESP Scores oF Susyects 
Wuo Reyecrep THIs Possipitiry (Goats), CLassiriep ACCORDING To SCORES FOR 
THEORETICAL VALUES ON THE ALLPORT-VERNON StuDY OF VALUES 


ESP scores that were only suggestively higher 
than those of the goats (paizy—=.06); while for 
the 26 Ss whose theoretical values fell in the 
goth percentile or above, the difference be- 
tween ESP scores for sheep and goats was 
highly significant (Paiyy—=.002). There were 
so few Ss in the 95th percentile or above that 
it is not profitable to consider their scores at 
length; but the implication of the sparse data 
available is that the same trend continues, and 
that the sheep-goat difference is still more 
pronounced for the Ss who were extremely 


tify with the purpose of the research, would 
be expected to turn out a better perform- 
ance than the others. The scores of the 
goats, however, may seem more perplexing. 
Although it is not statistically significant, 
their data show a tendency to be below mean 
chance expectancy, and we can give this 
tendency more weight because it is consistent 
with the findings of previous research (4, 6). 
Would not one expect them to hit at about 
the “chance” expectancy, rather than below it? 

Probably the key to their low average scores 
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lies in the fact that some of the goats, espe- 
cially those with high theoretical values, had 
such a firm and uncompromising negative 
attitude toward=«the experiment. They 
thought it was absurd. They disapproved of 
anyone’s wasting her time—and theirs—on 
this nonsense. For such Ss we can infer 
that they did not want their guesses to be 
correct, and that, at some unconscious level, 
they were trying to avoid the success that they 
said could not occur. Our results are reminis- 
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in ESP research, scores may be expected to 
fall below chance expectation. 

In interpreting these data, it is necessary to 
keep constantly in mind that our generaliza- 
tions do not apply to all the individuals 
within the group. The data of Table 2 indi- 
cate marked individual variation. This 
means that the attitude we have been study- 
ing is only one of the factors that determine 
ESP scores, and probably that, for most Ss, 
it is not the most important factor. One fur- 


TABLE 2 


NumBer oF Susyects with ESP Averaces Anove, aT oR BeLow Mean CHANCE EXPECTANCY 








AVERAGES ABOVE 
MEAN CHANCE 
EXPECTANCY 


Group 


AVERAGES BELOW 
MEAN CHANCE 
EXPECTANCY 


AVERAGES AT 
MEAN CHANCE 
EXPECTANCY 


Tora 
NUMBER 





All Subjects 
Sheep 
Goats 

Ss below goth percentile 
Sheep 
Goats 

Ss at or above goth percentile 
Sheep 
Goats 

Ss at or above 95th percentile 
Sheep 
Goats 

Ss at rooth percentile 
Sheep 
Goats 


64 
58 


49 
47 


15 
II 








rank refers to Theore 


Note: Percentile 
cent of those reported by Hull, in an experi- 
ment where Ss were told that they would 
sway forward (3). Many did so, but some 
negativistic individuals showed a pronounced 
sway backward. They, were influenced by the 
stimulus but they did not want to give the 
appropriate response to it and, like our goats 
with high theoretical values, their responses 
showed a negative deviation. Similar results 
were reported by Bruner and Postman in 
research on the perception of incongruity (2). 
The Ss who were shown playing cards printed 
so that spades were red and hearts were black, 
would often report that they had seen black 
spades and red hearts. These Ss, even more 
than ours, gave consistently incorrect re- 
sponses rather than “chance” responses. 
Bruner and Postman interpret this as “resist- 
ance to the unexpected or incongruous.” 
Where such resistance is known to be present 


tical Value scores. 


ther reservation should be made explicit: that 
the findings can apply only to groups tested 
under the conditions of this experiment. It 
was made clear to my classes that the purpose 
of the ESP tests was to study ESP, a problem 
which would be expected to be of most con- 
cern to Ss with high theoretical values. Pre- 
sumably in a different interpersonal situation, 
different patterns would have appeared. If, 
for example, we had promised substantial 
money rewards for high scores, we might 
have found ESP success more closely related 
to economic than to theoretical values. 
Further examination of our findings can 
place primary emphasis on either the Study 
of Values or on ESP. For the former, they 
not only support the truism that the more you 
knew about your Ss, the better you can 
understand the individual differences that 
appear in your research, but they go one step 
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further: they imply that if you know what 
an S’s values are, you can predict which 
aspects of an experiment will motivate him. 
To some S’s, understanding and approving 
the purpose of an experiment will make for 
better performance; to others it is almost 
irrelevant. This can be of crucial importance 
in research dealing with such topics as the 
influence of reward on performance, since 
the praise or money which is an important 
reward for one S may be of little concern to 
the next. 

What light does the experiment throw on 
ESP? In one way, very little: it does not tell 
us what the basic nature of the process is, or 
what its physical or physiological correlates 
are. But from the psychological point of 
view it indicates, as does so much other 
research, that the ESP function seems to fol- 
low the same laws as other psychological 
functions; that the conditions which would, 
for example, be expected to facilitate work 
efficiency are conditions which facilitate ESP 
success. Although in these experiments ESP 
seems to be feeble, with small percentage dif- 
ferences between groups, the general trend 
of the data is psychologically meaningful. 
This implies that ESP could profitably be 
studied as other psychological functions. are, 
rather than be relegated to some limbo out- 
side the bounds of scientific knowledge. 
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SUMMARY 


Unselected ESP Ss were asked whether 
they believed paranormal success under the 
conditions of the experiment was possible, 
and they were given the Allport-Vernon 
Study of Values. The predictions were made 
(a) that Ss who accepted the possibility of 
success would have higher ESP scores than 
those who rejected this possibility, and (4) 
that Ss with high theoretical scores on the 
Study of Values would show this difference 
more markedly than the other Ss. The data 
support both predictions. 
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HE experimental study of the products 

of small face-to-face groups from a psy- 

chological point of view remains rela- 
tively unexplored. The focus of the psychol- 
ogist has been primarily on the performance 
of the individual in the group, rather than on 
the psychological products of the group con- 
sidered as a group. 

The early experimentation by Dashiell (2), 
Travis (8), and others involved a comparison 
between the measured performances of the 
individual person when under influences 
from other persons physically present, with 
the measured performances in identical func- 
tions of the same individual when working 
alone. These researches on coacting groups 
did make some real contributions to an un- 
derstanding of a few of the conditions under 
which judgment, memory, and learning may 
take place. They reflected, however, the out- 
look on group phenomena that Allport ex- 


pressed in the statement, “There is no psy- 
chology of groups which is not essentially and 
entirely a psychology of individuals” (1). 
There are good logical and methodological 
reasons for examining Allport’s statement 
more critically. While it does represent one 
point of view still more or less prevalent in 


contemporary psychology, it is equally felt 
that Lewin has affirmed the tenability of an- 
other more fruitful position. He recom- 
mended a theoretical but emphasized an em- 
pirical justification of group experimentation. 
For the purpose of the present study, the fol- 
lowing quotation from Lewin will suffice 


to make this point clear. 

If recognition of the existence of an entity depends 
upon this entity’s showing properties or constancies 
of its own, the judgment about what is real should 
be affected by changes in the possibility of demon- 
strating properties. The taboo against 
believing in the existence of a social entity is prob- 
ably most effectively broken by handling this entity 
> <n 


social 


experimentally 


1 This 
Psychologie 
at the Sorbonne 
Fraisse. 


research was conducted at the Laboratoire de 
mentale de L’Ecole des Hautes Etudes 
under the sponsorship of Professor P. 


Exper 
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The experiments of Lewin, Lippitt, and 
White (5) and French (3) represented a great 
advance in the confidence in the reality of 
social phenomena. Their researches con- 
tribute primarily to the understanding of the 
effects of variation of such group variables 
as “atmosphere” and frustration on the be- 
havior of the individuals in the group. 

There has been relatively little research, 
however, on the properties of group-qua- 
group basic processes and especially the condi- 
tions and laws of change of group products 
such as group-perceived products, group- 
learned products, and group-memory prod- 
ucts. One difficulty may have been the lack 
of a precise method for evoking a group prod- 
uct. The method of agreement described in 
the procedure below is an attempt to provide 
a technique for evoking measurable group 
product behaviors, such as group-perceived 
products, group-learned products, etc. The 
method insures interaction among the mem- 
bers of the group but only one product to rep- 
resent the group. 

The previous studies that bear on the study 
of group basic processes are few. Watson 
(9) compared group “thinking,” that is, the 
ability to make words out of letters contained 
in a given word, and found the superiority 
of group products to the average and the 
best individual in the group. His groups 
were groups in which the members had a 
common goal but, because of the nature of 
the instructions and the task, interaction 
among the members was greatly limited. 

Shaw (7) conducted a study comparing in- 
dividuals and small groups in the rational 
solution of compex problems. She found 
also the superiority of groups in the ability 
to solve the problems given. Group activity 
seemed to insure that the errors made were 
usually at later stages in the solution of the 
problem, and that group superiority is in 
part due to the rejection of incorrect ideas. 

These two studies do attempt to examine 
the properties of group processes and prod- 
ucts. But among a number of shortcomings 





Group LEARNING OF 


primarily due to the lack of conceptualiza- 
tion of group phenomena, there are notice- 
able gaps in the explanation of the condi- 
tions under which a group “thinks” better or 
worse than an individual, or one group 
“thinks” better or worse than another. 

A research, using the method of agreement, 
was conducted by Perlmutter (6) on group- 
qua-group memory products. In this study 
the properties of group-qua-group memory 
products were considered, as well as some 
analyses of group versus individual memory 
and the conditions under which each may 
change. 

The present study is a second, now aimed 
at investigating under experimental condi- 
tions the properties of group-qua-group 
learned products, by repeating learning ex- 
periments long associated with individual psy- 
chology. 

This study aimed at establishing for the 
specific task and population a “learning” 
curve of group-learned products. The task 
of learning a list of nonsense syllables was 
given to individuals working separately and 
as a group. The group task required dis- 
cussion, interstimulation, and a single prod- 
uct agreed upon by the group as represent- 
ative of it. It called for an interchange of 
ideas, acceptance, rejection, and interaction 
that proved to be fairly rich, owing to the 
ambiguity of the nonsense syllables. 

The study examined the learning rate of 
the group as compared with the individuals, 
the time required by groups and individuals, 
and the errors of the group versus the errors 
of the individual in an attempt to make pre- 
cise under what conditions and along what 
dimensions groups differ from individuals on 
“learning.” 


METHOD 
Subjects 


The Ss in this experiment were students at the 
Sorbonne, in most cases first-year students of psy- 
chology. Although most participants were French, 
Ss included Italians, Egyptians, Moroccans, Belgians, 
and Swiss. All spoke French fluently. There were 
very rare communication difficulties due to language. 
The experiments were conducted during two hours. 
During one part, the group was given a list of 19 
two-syllable “nonsense” words to learn as a group; 
the other part, after a 15-minute rest, required 
that the individuals work separately but in the 
presence of one another on an equivalent but dif- 
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ferent list. The order of procedure and lists were 
varied systematically. Twenty groups of three per- 
sons, homogeneous according to sex, were used (6 
groups of males, 14 groups of females). No sex 
differences in performance were noticed. 


Materials 


Two equivalent lists of 19 two-syllable nonsense 
words were used. Their equivalence is demon- 
strated statistically. There are no differences be- 
tween groups or individuals using the different lists. 

Nonsense syllables were used because the lack of 
logical associations with each word stimulated dis- 
cussion. This also made possible the repetition of 
the experiments with American groups. 


Working as a Group 


The following instructions were used to evoke a 
group product during the part of the experiment 
in which the individuals worked as a group (transla- 
tion from the French): 


The purpose of this experiment is to find how you 
are going to learn a list of 19 two-syllable nonsense 
words, working as a group. Your individual results 
are not of interest; the product of your collaboration, 
as close as possible, is of primary importance. It is 
necessary to make the maximum effort to retain all the 
words as quickly as possible; the order in which you 
recall them is of no importance. 

This is how we are going to proceed: I will present 
the list to you the first time, cach word written on a 
card, presented separately and regularly. When all the 
words have been presented, I will go to the blackboard 
and say “Go.” If one of you remembers a word, he 
says it aloud, I will write it on the blackboard and the 
other members of the group as well as the ones who 
gave the original should discuss this and other possi- 
bilities, until a group accord is reached. The others 
should express their opinion as to the exact form of the 
word. Only one word will be adopted to represent 
the group. Unanimous agreement is not necessary. It 
is necessary that you ail contribute; the order with 
which you intervene does not matter. When you have 
reached a decision as a group, tell the recorder whether 
you wish to keep the word or not, before passing on to the 
next word. If after 3% no new word has 
been presented, the observer will say, “That's sufficient,” 
and then I will present the list a second time followed 
by a second recall in the same way as with the first 
recall and so on. 


s« conds 


The group was not told there were to be five 
recalls. 


Working Individually 

The instructions for the second part of the ex- 
periment were given after a 15-minute rest in 
which the group was asked not to discuss the 
experiment: 


The purpose of the second part of this experiment is 
to see if you can learn a list of 19 two-syllable nonsense 
words working alone: we are interested in your results 
as individuals, and not the product of any kind of col 
laboration. It is necessary then that during your work 
you do not display any feelings of success or failure, and 
if you have the feeling of having learned the list you 
continue to work like your neighbors. It is necessary 
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that you make the maximum effort to remember the 
words as quickly as possible, the order with which you 
remember is not important—this is how we shall pro- 
ceed. I will present a list a first time, each word written 
on a card at a regular rhythm. When all the words 
have been presented, I will say, “Go,” and you are to 
write on the sheets of paper before you the first word 
in the list that comes to your mind. You are to note 
with as much care as possible the degree of certitude 
you have about each word: 9 for absolutely certain the 
word is correct; 5, some certainty, and 1 if you are 
quite uncertain but still believe you want to put the 
word down. It can also occur that you remember 
a word but you are not certain of it, and think of 
another that seems more likely. In this case, bar lightly 
the first word, and write below the one you believe to 
be better until you reach a certain degree of agreement 
with yourself. To mark the index of certitude, fold 
the paper so that the previous words are not visible, and 
proceed to the next word. After a time when nobody 
writes for 30 seconds, the experimenter will say, “That 
is sufficient,” and will present the lists again, followed 
by a second recall, etc. 
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worked first and then the individuals in the 
group worked separately (called G-I groups). 


Number of Nonsense Words Correctly Re- 
called Each Trial 

a. There is little or no difference between 
G-I and I-G groups with respect to the num- 
ber of nonsense words correctly recalled by a 
group or an individual during each trial (total 
score). This was the basis for combining the 
I-G and G-I groups. 

4. G-I individuals have an average total 
score significantly greater than I-G individ- 
uals. In other words, working in a group 
first significantly helps the individuals to 
learn an equivalent list later. 

c. The total scores of I-G groups average 


TABLE 1 
Number oF Nonsense Worps Correctty Recatrep Eacu Tria 
(Total score) 








I-G Group 





AVERAGE AVERAGE 
OF INpI- p* 
Groups VIDUAL 


AVERAGE OF 
Best INp1- 
VIDUAL 


TRIAL 


pe oF 


G-I Group 





AVERAGE OF 
Best INp1- 
VIDUAL 


AVERAGE AVERAGE 
INDI- 


Groups VIDUAL 





2.4 
7.3 


3 


5. 


9. 
-2 It. 





* Group vs. average individual. 
** Group vs. average of best individual. 


Again the number of trials (five) was not specified. 
It is to be noted that after the fifth trial of working 
as a group, there was an individual recall, and after 
the fifth trial of working individually, there was a 
group recall. 

The instructions were rarely misunderstood; the 
most common question, for about 30 per cent of 
the groups, was: “Can we organize so that A learns 
6, B 6, and C likewise?” E then said, “No, we are 
interested in your collaboration on each word. In 
the case you suggest, we would be measuring how 
well person A learned Azs six words, B his six, etc.” 
In all cases the group agreed and the experiment 
was continued. This, however, does not preclude 
the possibility of implicit organization. 

After the experiments, the groups were asked 
(a) how fatigued they had been, and (4) how they 
had enjoyed working in a group versus working in- 
dividually. 


Resutts AND Discussion 
In the statistical analysis, the 10 groups 
and 30 individuals who worked first as in- 
dividuals working separately and then as a 
group (to be called I-G groups) were com- 
pared with the 10 groups wherein the group 


higher than I-G individuals at the .o1 level of 
significance. The I-G group scores are also 
higher than the best total scores for individ- 
uals of each group. 

d. G-l individuals have a significantiy 
greater total score in the second, third, fourth, 
and fifth trials. The difference for the first 
trial is not statistically significant. Further- 
more, there is no statistical difference between 
the G-I groups and the best G-I individual 
of each group. In fact, in the first two trials, 
the best individual’s average is higher than 
the group average. (See Table 1.) 

Affective interrelations between the mern- 
bers were important determiners of the group 
product. Often correct words were rejected 
by members of the group when the proposer 
of the word was in disfavor with the others. 
On the other hand, the “popular” leader was 
able to maneuver the group to accept dis- 
torted, incorrect words. On a number of 
occasions groups spent an extremely long 
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time trying to reach a decision about the 
form of a word and, by doing this, forgot 
other words. 

Practically every group had such situations, 
some groups more than others. Where the 
intragroup problems were prominent, the 
superiority of the group product to the aver- 
age or best individual product is minimal. 


Group-Learned Product Curve 


The curve of group-learned products is seen 
to resemble greatly the curve of individual 
learning (Fig. 1). This may be interpreted 


Yoyo 
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f 
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Fic. 1. Averace Curves oF Group-LeaRNED 
Propucts AND INpIVIDUAL LEARNING 


as a point in favor of establishing a more firm 
analogy between individual learning and 
group learning, as well as giving some weight 
to the use of the concept of “group learning 


curve.” The increased acceleration of the 
curve of group-learned products as compared 
to the curve of individual learning is a char- 
acteristic of group learning. But it is well to 
emphasize the conditional superiority of the 
group to the individual. One glance at the 
curve of the best individuals of the G-I groups 
proves how individuals, with some previous 
training in a group, can perform as well 
as groups. This has not been reported previ- 
ously, but it is evident that there are many 
instances in life where individuals working 
separately can produce detter than a group of 
persons. In these experiments, in 60 per cent 
of the G-I groups there were individuals who 
surpassed the group at least three times when 
working separately. There were cases where 
all three individuals, when working sepa- 
rately, produced more than the group did 
during that particular trial. Among the I-G 
groups, there was one case in which an indi- 
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vidual performed better than the group at 
least three times. 


Time Required 

a. There is little difference between groups 
(G-I and I-G combined) and individuals with 
respect to the time required per trial (see 
Table 2). 

b. The average time of all the groups is sig- 
nificantly greater in the first three trials, but 
not in the last two (see Table 2). 


Rate of Recall 

a. There is no statistical difference between 
G-I and I-G groups with respect to the rate 
of recall. Rate of recall is defined as the total 
score on a particular trial divided by the time 
required for that trial. 

5. G-I individuals have a_ significantly 
greater rate of recall than the I-G individuals. 

c. I-G groups have a greater recall rate than 
the I-G individuals (on the average) for all 
trials. 

d. G-I individuals have a greater recall 
rate than G-I groups at the first trial. The 
differences for the second and third trials are 
not statistically significant. By the fourth and 
fifth trials, the direction of superiority 
changes, and the G-I groups surpass the G-I 
individuals (Table 3). 


TABLE 2 


Time For A TRIAL 








AVERAGE FOR 
I-G anv G-I 
Groups 
CoMBINED 


AVERAGE 


TRIAL . nent 
INDIVIDUAL 





“I 
5.0 


si0 





* Average time for slowest individual. 


These results indicate that, although groups 
require more time than individuals during 
the early trials, during the later trials they not 
only require equivalent time per trial but re- 
call more correct words than do individuals. 
This rate of recall is one indication of the 
superiority of the group procedure. The 
change in recall rate from the first to the 
fifth trial is interpreted as an increase in the 
efficiency of functioning as a group. The 
group, from the point of view of total score, 
resembles the individual more during the 
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TABLE 3 
Rate or Recat * 








I-G Group 


AVERAGE 
INDIVIDUAL 


AVERAGE 
or GROUPS 


G-I Group 





AVERAGE 
INDIVIDUAL 


AVERAGE 
or GrouPs 





21 -78 
-97 1.57 
-92 2.1 

-48 2.57 
3.57 2.91 


- 86 1.33 
2.48 2.18 
3-42 2.80 
8.35 3-19 +02 
5-35 3-25 +02 








early trials. With respect to total time re- 
quired for recall during the trial, the group 
seems to resemble the individual more in the 
later stages.” 

But the superiority of the rate of recall of 
the group is further supported by the calcula- 
tions and qualitative observations on the fre- 
quency with which the group adopts defor- 
mations of correct words. The group sifts out 
these incorrect responses, primarily because 
there is a series of rejections and evaluations 
of each word before the decision to adopt the 
word is taken. This corresponds to Shaw’s 
findings in her work on group problem- 
solving (7). The “critical” forces within a 
group are greater on the average than the 
“critical” forces within the individual. The 
individual is usually less reluctant to put 
down a deformation of a correct word. 


2It is important to note that while the time for 
individuals has been based on the slowest individual in 
the group, because of the difficulty in determining when 
a particular person finished, the difference in time 
required for the fastest and slowest individual is very 
small—probably never more than 30 sec. The slow 
individuals tended to stop working when the others 
did. This “facilitative” effect has been noted in other 
studies wherein individuals work individually but in 
the presence of others. The significance of the results 
must, however, be restricted to the extent that this 
variation makes a difference in the calculation of the 
individual rate of recall. 


*The number of nonsense words correctly recalled for each trial divided by the time required for that trial. 


Invented Words 


a. The slight difference between G-I and 
I-G groups with respect to the number of 
invented words is also statistically not sig- 
nificant. 

6. G-I individuals invent significantly fewer 
words than I-G individuals (working sepa- 
rately). 

¢. I-G individuals invent significantly more 
words than I-G groups in all five trials. Dur- 
ing the first trial there is no statistical differ- 
ence between G-I groups and G-I individ- 
uals working separately, but in the last four 
trials the G-I individuals invent significantly 
more words (Table 4). 

This means that under some conditions a 
group invented many more words than a par- 
ticular individual in the group. The process 
of inventing a word has to be seen in terms 
of its dynamic meaning to the group. One 
group, for example, adopted a word Zevut in 
the third trial as being a word in the list pre- 
sented. After the third trial, a presentation 
was made in which the correct word, Zuvat, 
was presented. Nevertheless, in the fourth 
trial and the fifth trial, Zevut was unani- 
mously adopted by the group. It is inter- 
esting to note that it was not always suggested 
by the same member of the group. This 


TABLE 4 


INVENTIONS 


I-G Group 


AVERAGE 
INDIVIDUAL 


AVERAGE 
or GROUPS 


G-I Group 





AVERAGE 
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“hallucination” should be viewed in its con- 
text as an attempt to establish a common 
product, irrespective of the objective reality. 
It appears to have reflected, in part, a need 
in the group to have agreement-on-something. 
It appeared in a group that had experienced 
a number of failures in functioning as a 
group. The reality-orientation of the group 
was relaxed in order that this agreement-on- 
something could be reached. This interpreta- 
tion is posed as a hypothesis relative to the 
problem of understanding psychologically the 
conditions under which a group makes 
errors, or is “objective.” 


Repetitions 


a. The differences between I-G and G-I 
groups and individuals with respect to the 
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been an agreement-for-agreement’s sake, de- 
signed to speed up the recall process. In a 
competitively oriented group, however, where 
the “personal” needs of the members are 
more important than “group” needs, there is 
a need to contribute a word, and from the 
point of view of the contributor, the word 
presented may be a “new” word since he did 
not “hear,” psychologically, its presentation 
earlier. The number of repetitions made, it is 
believed, can often be seen as an index of the 
lack of psychological communication between 
the parts, a clue as to the degree of functional 
dependence between the members of the 
group. 

But it is of utmost importance to note that 
these were repetitions adopted by the group 
and hence reflect properties of the group-qua- 


TABLE 5 
REPETITIONS 








I-G Group 
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AVERAGE 
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AVERAGE 
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number of words repeated during a single 
trial are not statistically significant. 

&. During the first two trials there is no 
difference between the average number of 
repetitions of the groups and those by the in- 
dividuals working separately. But during 
the last three trials, the group repeats sig- 
nificantly more. (See Table 5.) 

The phenomenon of “repetition” seen from 
the point of view of the group and the indi- 
vidual-working-separately is a “forgetting” 
of a previously adopted decision. The task in 
this experiment required that a list be learned, 
and also that in each trial the groups (and 
individuals) know what they have said dur- 
ing that trial, since they do not see the words 
they have adopted during a given trial. (See 
procedure section above.) It is likely that in 
many cases the group “automatically” adopted 
a word suggested by member A, and that 
later on member B suggested the same word. 
B’s previous agreement with the word when 
suggested by A for the first time may have 


group, as well as the specific interpersonal 
relations between members of the group, or 
the personal motivations of a particular in- 
dividual in the group. Future experimen- 
tation in which control of organization 
and dynamics of a group is made should at- 
tempt to determine conditions under which 
a group invents or repeats more or less, 
Evidences of “Implicit” Organization 

A number of groups performed inefh- 
ciently because the activity of learning the 
nonsense words was not organized. In all 
cases, the groups had no predetermined or- 
ganization to the extent that there were no 
defined roles or systems of learning as a 
group. The most efficient groups, however, 
showed some kind of organization. The 
term “organization” is rather crudely used 
here. It refers to the extent to which there 
was efficient use of the roles necessary to 
come up with a group agreement. Members 
have to suggest, clarify, stimulate, and limit 











768 


discussion, and call for a “group decision.” 
Inefficiencies in this procedural aspect of the 
recall limited to some extent the total score 
of the group. 

Sometimes during the recall one member 
of the group presented the same word on each 
trial. This “assignment” of words to par- 
ticular parts of the group was probably un- 
conscious. Most groups professed not to 
have organized among themselves, and no 
fixed assignment of several words in the list 
was noted. Each member of the group did 
try with various degrees of effort to learn 
each word, but from the point of view of the 
individual member, he could relax his atten- 
tion when a word usually given by another 
was presented. Most groups reported, for 
example, that the group task was less fatigu- 
ing, even when it came in the second part 
of the experiment. 


Individual Recall After Group Recall 


For all groups, an individual recall was 
taken after the fifth group-qua-group recall. 
These data shed some further light on the 
degree of implicit organization and the extent 
to which one member of the group carried 
the burden of the group recall. 

Generally, there was a great degree of over- 
lap in the words remembered among the 
members of the group. While one or two 
words were given special attention by par- 
ticular individuals, there was general collab- 
oration on the other words. The average 
difference between the best and the worst 
individual recalls was 5.1 words. 

In only two groups did an individual, dur- 
ing the individual recall after the fifth trial 
of the group-qua-group recall, reach the 
group score. No individual surpassed the 
group score under these conditions. 

In addition, the theoretical potentiality of 
the group was determined by a summation 
of the number of different words learned by 
the three different individuals during the in- 
dividual-working-alone part of the experi- 
ments. These results are not clear. There 
were several groups that could have pro- 
duced all 19 words if they had pooled their 
individual resources. ‘There were several 
others who actually learned 19 words while 
working as a group, but could not have pro- 
duced 19 words or their actual group score, 
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even if they had added the different words 
each learned while working as an individual. 
Their actual performance exceeded their po- 
tentiality. 


The “Method of Agreement” 


The fruitfulness of the “method of agree- 
ment” in experiments on learning is at least 
indirectly measured by the results of this ex- 
periment. Furthermore, the instructions 
seem to have a clear psychological significance 
to the groups as well as to the individuals 
working separately. The groups understood 
what was meant by working as a group and 
discussing among themselves until one prod- 
uct, representative of the group as a group, 
was reached. The participation of each 
member, whether overt or covert, was noted. 
But there were differences in the amount of 
agreement that was satisfactory for the group. 
This was also noted in the individual self- 
ratings when the individuals worked sepa- 
rately. There were group as well as individ- 
ual differences in the degree of certitude 
about words included in each recall. It is 
possible that the method of agreement sets 
up analogous “internal situations” for the 
group considered as a group and the in- 
dividual considered as an individual. 


Group Fatigue 


It was noted in the cases of groups who 
worked as individuals first and as a group 
second (I-G groups) that there were few 
instances of fatigue (except in those groups 
where interpersonal relations were rather 
strained). The groups reported that they 
were clearly more fatigued in the first part of 
the experiment than in the second, even 
though during the second part of the experi- 
ment they were asked to learn another list 
of two-syllable nonsense words. Future re- 
search could point up why a group fatigues 
less quickly and why, in a few cases, satiation 
in the group, once begun, was extremely 
rapid in “snowballing” to such a point that 
group interest in the task became almost non- 
existent. 


“Group Intelligence” 


The group-learned product curve is often 
identical to the learned product curve of a 
poorer member of another group. Clearly 
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there are “low intelligence” groups and “high 
intelligence” individuals. One adult can solve 
a problem that thirty children could not even 
comprehend. Within the adult groups as 
well, it would be difficult to derive the group 
“intelligence” from the summation of indi- 
vidual intelligence scores. This is a short- 
coming of this research, and of any other 
that does not control group-qua-group intelli- 
gence. It is suggested that a group intelli- 
gence test could be constructed using the 
method of agreement. Until this is done, we 
are not certain that a group performs well 
because it is inefficient but very intelligent 
as a group, or because it is quite unintelli- 
gent but very efficient in employing its po- 
tentialities. 


SUMMARY 


Twenty three-person groups of French- 
speaking students at the Sorbonne were given 
the task of learning two equivalent lists 
of two-syllable nonsense words, working sep- 
arately but in the presence of the others, and 
working together as a group with a common 
goal. 

Variation in the order of administration, 
i.e., whether the individuals first worked sep- 
arately or first worked in a group, produced 
differential results: the individuals who 
worked in the group first had significantly 
higher total scores and a better rate of recall 
than did the individuals who worked sepa- 
rately first. These individuals also invented 
fewer words. There are no statistical differ- 
ences between individuals who worked first 
and those who worked second with respect 
to time required per trial, or between groups 
who worked first or second with respect to 
all dimensions (total score, time, rate of re- 
call, inventions, and repetitions). It was con- 
cluded that group life can contribute to in- 
dividual learning of the same kind of task 
later. 

The curve of group-learned products was 
determined. It is a smooth curve that re- 
sembles the curve of individual learning, but 
is more positively accelerated. In particular 
cases, a group-learned product curve coin- 
cided with an individual learning curve. 

Groups were found to be superior to indi- 
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viduals working separately but in the pres 
ence of others under special conditions. The 
group is superior to the average individual 
in total score but is not significantly different 
from the average of the best individuals in 
the groups, if the best individuals have previ- 
ously worked in a group on a similar task. In 
seven groups out of 20 instances, there were 
individuals who surpassed or equalled the 
performance of the group. 

Groups require more time to recall during 
early trials but have a higher over-all recall 
rate than individuals. 

Under some conditions, a group makes 
more errors by repeating more words during 
a particular trial than individuals do, but in- 
dividuals on the average invent or deform 
more words than groups. 

The conditional superiority of a group is 
stressed along all dimensions. It is difficult 
to predict whether groups in particular cases 
will be superior to particular individuals, will 
make less errors, or require more or less time. 
Some of the conditions under which a group 
may be less efficient were discussed, and 
analogies were drawn between individual in- 
ternal processes and group external processes. 
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aggression theorists that aggressive tend- 

encies may be aroused by the blocking of 
any goal-directed behavior. This general 
hypothesis and numerous related hypotheses 
have been explored in considerable detail (2, 
3, 8). The present study is concerned in a 
restricted way with the “fate” of aggressive 
tendencies (or hostile impulses) once they 
have been aroused. Generally these tend- 
encies or impulses may be either inhibited or 
discharged as overt aggression. If discharge 
o¢curs then presumably the level of hostile 
impulse is thereby lowered. In the language 
of the frustration-aggression theorists, “. . . the 
occurrence of any act of aggression is assumed 
to reduce the instigation to aggression. In 
psychoanalytic terminology, such a release is 
called catharsis” (2, p. 50). 

Current interest in social communication 
(4), particularly in the communication of 
emotions, has led to a renewed concern with 
hypotheses of the “catharsis” variety. In the 
context of social communication the “cathar- 
sis” hypothesis is stated as follows: the 
communication of hostility through overt 
aggressive behavior directed toward a (per- 
sonal) instigator will tend to reduce the resid- 
ual hostility toward that instigator. The 
present study is designed to test this hypoth- 
esis. 

The immediate antecedent of this study is 
some work done by one of the present authors 
(11) on the cohesiveness of low-status groups. 
There it appeared that the “catharsis” hypoth- 
esis was confirmed. However, in this earlier 
study, as well as in a study by French (5), 
insufficient control and standardization made 
the results somewhat equivocal. The present 
experiment was designed to overcome these 
difficulties. 

A secondary purpose of the present study 


I is the basic contention of the frustration- 


1 The research reported in this paper was made pos- 
sible by a grant from the Graduate School of Boston 
University. 

2 Now at Harvard University. 


was to examine the restriction imposed on 
the generality of the “catharsis” hypothesis 
by the “autistic hostility” hypothesis of New- 
comb (9). This hypothesis states that a hos- 
tile attitude is protected from change by 
leading to a breakdown in the communica- 
tion necessary for the alteration of the atti- 
tude. The relevance of the application of this 
hypothesis will become clearer in the discus- 
sion of the experimental results. At this point 
it is sufficient to suggest that the Newcomb 
hypothesis implies the following: there is a 
positive relationship between degree of hostil- 
ity and relative effectiveness of inhibition of 
overt aggression. Thus, even without exter- 


nal barriers to aggression, it is assumed that 
the amount of overt aggressive discharge is 
not a direct function of the degree of hos- 
tility. 


EXPERIMENT | 


Subjects 


The subjects (Ss) were 41 volunteer male under- 
graduate students recruited from an elementary psy- 
chology course at Boston University. 

The Ss were scheduled to appear individually at 
the experimental rooms. On arrival each S was in- 
troduced to two paid participants who were students 
at a neighboring division of Boston University. The 
roles of the paid participants were standardized and 
well-rehearsed in advance. They pretended to be 
maive as to the purpose of the study and unac- 
quainted with one another. 


Procedure 


The experimenter (FE), in his preliminary instruc- 
tions, reminded Ss of the sentence completion test 
that they had completed at time of recruitment. He 
was interested in discovering, he said, whether un- 
dergraduate students could not perhaps learn as 
much about one another’s personalities by direct 
communication as could professional psychologists 
working from test materials. This served as the 
pretext of the experiment. 

At this point E designated the naive S as one 
“diagnoser” and, apparently arbitrarily, selected one 
of the paid participants (always the same one) as 
the other “diagnoser.” The remaining paid partici- 
pant was assigned the job of messenger. The two 
diagnosers were to communicate to one another 
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about themselves and the messenger was to deliver 
the communications. 

Each diagnoser was given his own small room in 
which to work. The two rooms were adjacent and 
separated by a large glass window. The diagnosers 
were seated at tables facing the window and hence 
each was in constant view of the other. 

Before proceeding with the main business of com- 
municating to one another, each diagnoser was in- 
structed to write an initial “personality sketch,” 
consisting of his first impressions of the other 
diagnoser. (Since these were to be seen only by E, 
the paid participant needed only to pretend he was 
writing a sketch.) 

This much accomplished, Ss began the communi- 
cation process, on which two restrictions were im- 
posed: all communication must be in the form of 
written notes and the content of communication 
must refer to self. The exact instructions given 
were as follows: 


We would like to have you go on now and write 
notes to each other. You can write as many notes as 
you want to. Here is some paper you can use. Write 
about yourself, telling the other fellow whatever you 
want to about your personality, your ambitions, your 
likes and dislikes, and.so on. But be sure you write 
only about yourself, telling as much as you can, so that 
at the end the other fellow will be able to write a 
better personality sketch of you. When you've finished 
writing a note, rap on the table and the messenger 
will come in and deliver it. As soon as you receive a 
note read it right away. Don’t let the notes pile up. 


In acting as second diagnoser, the paid participant 
initiated a standard set of eight notes. These were 
sent at intervals of two minutes. The first seven 
of these were intended to characterize him as arro- 
gant and opinionated but not “strong” enough to 
inhibit the S’s aggression. Two of the notes will 
illustrate the effect intended: 

Note 3: I know I'll succeed in my profession, be- 
cause I am so far above everybody else in my class. 
I got 98 on my last biology exam and that’s pretty 
good. I am the best student in the lab course also. 

Note 7: Since I am kind of weak, I haven’t been 
good in any sports—but that’s just kid stuff, I want 
to be a man. I am sure if I wanted to be a good 
athlete I would be very successful even though I 
don’t care much for physical contact of that sort. 

The eighth note was calculated to act as the 
instigation to hostility. In this note the paid par- 
ticipant (referred to as the “instigator” in the next 
section) now ignored the instructions to write about 
self and sent off a strong denunciation of the S. 
The precise wording of the note was as follows: 
“You know, you're so full of bull , that I’m 
not going to talk about myself anymore. You're the 
most egotistical, deceitful /iar I’ve ever seen. How 
anybody can fake about his own abilities as much as 
you have I'll never know. I don’t think you're 
the type of guy I'd care to associate with. Why 
don’t you play it straight, fellow?” 

Naive Ss were assigned at random to one of two 
experimental treatments. Twenty of the Ss (No- 
Com group) were not permitted to communicate 
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after receiving the instigation (eighth note from 
the paid participant). As soon as the No-Com § 
was seen to have finished reading the instigation, 
E entered the room and announced that the allotted 
time was up. The remaining 21 Ss (Com group) 
were permitted to write one additional note after the 
instigation. As soon as this had been delivered, E 
terminated the proceedings. 

At the conclusion of the communication period, 
each of the two diagnosers wrote a final personality 
sketch of his partner (again the paid participant 
merely pretended). When this was finished, E in- 
formally interviewed the naive S, while a second E 
pretended to be similarly engaged with the paid 
participant. 

Finally, before each S was dismissed, the nature 
of the experiment was fully explained to him and 
the paid participants were reintroduced to him, with 
apologies and a request for cessation of hostilities. 


Copinc ProcepuRE 


In treating the data, content analyses were 
done separately for the notes communicated 
by the naive Ss and for their pre-experimental 
and post-experimental “personality sketches” 
of the paid participant (instigator). 

For the notes the content analysis was made 
in accordance with four general categories, 
some of which were further subdivided. 
These categories provided the measures of 
overt aggression. 


A. Instigator as Referent 

I. Direct aggression: includes any wish, fantasy, 
or intention to commit a destructive act; attribu- 
tion of undesirable property or denial of desirable 
property; direct expression of hostile feeling. 

Il. Indirect aggression: includes negative model- 
ing and strong disagreement. 

Ill. Affective neutrality. 

IV. Friendly statements. 
B. Other Person, Group, or Institution as Referent 

Differs from A only in referent. The same sub- 
categories as above were used. 
C. Self as Referent 


__ I. Self augmentation (SA): attribution to self of 
importance, acclaim, achievement, virtue, etc. 
_ IL Self reduction (SR): disparages, condemns, 
inculpates self. : 

III. Self neutral (SN): residual category. 
D. General and Miscellaneous 

Statements without personal reference. 


In coding the notes, each sentence was 
taken as a unit. When a change in category 
occurred within a sentence both categories 
were scored, each as a separate unit. Two in- 
dependent coders obtained an agreement of 
72 per cent for all the categories, including 
subdivisions. 
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The code categories used in analyzing the 
personality sketches were similar to those for 
the notes. Analyses of the sketches provided 
the pre-experimental and _post-experimental 
measures of hostility and friendliness toward 
the instigator. In brief, the categories in- 
cluded the following: 

Hostile statements: includes expression of 
hostile feeling toward the instigator, attribu- 
tion of unfavorable characteristics, negative 
modelling. 

Friendly statements: includes all reference 
to instigator as object to be admired, liked, 
modelled; attribution of favorable characteris- 
tics. 
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centages for each of the two experimental 
groups. Means for percentages of friendly 
and of neutral units were obtained in the 
same way. Table 1 summarizes these data. 
It is clear that the two experimental groups 
are reasonably well equated with respect to 
mean percentages of hostile, friendly, and 
neutral units in their pre-experimental “per- 
sonality sketches.” The post-experimental 
sketches, however, begin to show differences 
between the groups. It will be noticed that 
for both groups the percentage of hostile 
units increases markedly (as we would expect 
if the instigation has been effective) and that 
friendly units tend to disappear. In evaluat- 


TABLE 1 
MEAN PERCENTAGES OF THE CoNTENT CATEGORIES APPEARING IN THE Pre- AND Post-EXPERIMENTAL 
“PERSONALITY SKETCHES” OF SUBJECTS IN THE Two EXPERIMENTAL Groups 





Pre-ExpeRIMENTAL SKETCHES 


M 





Post-ExPERIMENTAL SKETCHES 





SD } SD 





No-C 

Hostile 

Friendly 

Neutral plus General 
Com (N=21) 

Hostile 

Friendly 

Neutral plus General 





Neutral statements: includes all nonaffec- 
tive references to instigator. 

General and miscellaneous statements: in- 
cludes all comments having no apparent ref- 
ence to the instigator. 

The conventions for units and double cod- 
ing were the same as in coding the notes. 
Two independent coders obtained an agree- 
ment of 75 per cent in coding this material. 


RESULTS 


The major hypothesis to be tested was as 
follows: The communication of hostility to 
an instigator will tend to reduce the residual 
hostility toward that instigator. 

Let us see whether the results of the ex- 
periment give support to this supposition. 
For each S’s pre-experimental “personality 
sketch” (and then separately for his post-ex- 
perimental “sketch”), we computed the per- 
centage of his total units that were hostile. 
We then computed the means of these per- 


ing the magnitude of change we shall focus 
separately on the hostile and the friendly 
units. In accordance with our major hypoth- 
esis, we predicted that the mean change in 
percentage of hostile units would be sig- 
nificantly greater in the No-Com than in the 
Com group. Conversely, we predicted that 
the reduction in the percentage of friendly 
units would be significantly greater in the 
No-Com than in the Com group. Analysis 
of the data shows that, although the trend 
is in the predicted direction, the mean per- 
centage increase in hostile units is not sig- 
nificantly different in the two groups. How- 
ever, the data for friendly units do show the 
No-Com group to change significantly more 
sharply toward fewer units than does the Com 
group. The difference between pre- and post- 
experimental sketches is 38.9 for the No-Com 
group, while for the Com group the difference 
is only 29.2. The ¢ obtained between these 
two mean changes was 1.67, which just 
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reaches significance at the 5 per cent level. 
It is possible that relative frequency of 
friendly units may be a more sensitive meas- 
ure of affective changes than frequency of 
hostile units since the range of the latter 
may be somewhat restricted, particularly in a 
college population, by taboos against the ex- 
pression of more than a moderate amount 
of hostility. Hence, we may perhaps assume 
that friendly units are measuring (better) the 
same affective states as hostile units. 

To the extent that our friendiiness measure 
is acceptable the hypothesis appears to be con- 
firmed. It should be noted, however, that in 
a strict sense we have not demonstrated the 
“catharsis” phenomenon, since from our data 
we cannot distinguish between the following 
two alternative explanations: impossibility of 
communication has heightened the hostility 
in the No-Com group; or communication 
directed toward the instigator has resulted in 
a reduced hostility in the Com group. 

The possibility that the thwarting of com- 
munication has increased hostility in the No- 
Com group is predicted by the authors of 
Frustration and Aggression (2, p. 53) and in 
a more general statement by Whiting.* That 
this mechanism may be operating in the pres- 
ent situation is demonstrated by a second 
experiment that we have done in this area. 


ExperimMENtT II 


This experiment was similar in all respects to 
Experiment I up to and through the instigation. At 
that point the No-Com Ss were interrupted by E, 
who talked to them for about three minutes in an 
informal matter-of-fact manner. The Com Ss were 
permitted to write one note subsequent to the insti- 
gation before the informal interruption occurred. 
Then Ss in both experimental treatments were per- 
mitted to continue communication with the insti- 
gator, who initiated an additional seven notes in a 
“neutral” vein. Units of aggression directed toward 
the instigator during the total period after instiga- 
tion were counted for all Ss. The count included 
units contributed by the Com Ss in the note immedi- 
ately after instigation. 


RESULTS 


The data for 18 Ss (9 in each experimental 
treatment) show a higher total volume of ag- 


8 Since the direction of the difference was predicted 
in advance, a one-tailed test of significance was used. 
With 39 degrees of freedom a ¢ of 1.65 is required for 
significance at the .05 level. 

4Wuirtnc, J. W. M. A brief theory of purposive 
acquired drives. Unpublished manuscript, 1948. 
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gression for the No-Com group. Mean ag- 
gression of the No-Com Ss is 2.69 units, while 
for the Com Ss the mean is only 1.36 units. 
In testing the magnitude of this difference 
with 16 degrees of freedom a ¢ of 1.75 is 
required for significance at the 5 per cent 
level.5 The obtained ¢ of 1.73 thus closely 
approaches significance. The implication 
seems to be that to the extent that this differ- 
ence is reliable, the thwarting of communica- 
tion back to the instigator immediately after 
instigation increases the level of hostility, in 
spite of the fact that the instigator is not re- 
sponsible for the interruption. In terms of 
our major experiment, described earlier, the 
difference between the two experimental 
groups in resultant friendliness may be at- 
tributable entirely to this phenomenon of in- 
creased hostility in the No-Com group, 
though we have not demonstrated that the 
alternative principle is not also operating; i.e., 
communication may also be reducing the 
level of hostility in the Com group. The lat- 
ter possibility certainly cannot easily be dis- 
missed in the light of considerable direct and 
indirect evidence in other areas of study that 


consummatory behavior appears to be accom- 
panied by reduction of tension. (See, for 
example, the Lewinian studies in level of 
aspiration [7]; for more direct evidence, the 
study of Child and Whiting [1].) In subse- 
quent discussion we will mean by the term 
“catharsis-conflict effect” the general process 


whereby communication after instigation 
leads to relatively reduced hostility (or rela- 
tively increased friendliness) where we can- 
not specify for sure the mechanism. 


Discussion 
Oualitative Aspects of Communication 


Let us now return to the main experiment, 
described earlier. The upper part of Table 2 
shows the mean percentages of units of com- 
munication in the various content categories 
during the period prior to definite instigation. 
Note that the two experimental groups appear 
to be fairly well equated before the experi- 
mental variable was introduced. The lower 
part of Table 2 shows the way in which the 

5 Since our pre-experimental prediction was that the 
thwarting of communication immediately after insti- 


gation will increase the level of hostile tension, a one- 
tailed test of significance was again used. 
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Com group distributed its communication 
units in the notes sent just after instigation. 
Comparison of the two sections of Table 2 re- 
veals a marked shift after instigation toward 
aggression directed to the instigator and a 
concomitant reduction in neutral references 
to self. 

Anticipating a variable that we will study 
more fully later, let us look at the relation- 
ship between initial disposition * toward the 
instigator and the qualitative aspects of the 
notes written before instigation. As a result 
of poor distribution in some of the process 
categories (A and D in Table 2), only four 
of the categories could be analyzed. Of these 
four categories only self-reduction (SR) 
proved to be significantly related to initial 
disposition (hostility-friendliness). The biser- 
ial correlation * obtained was +-.43, which for 
41 cases is significant at beyond the .05 
level. The more initial hostility toward the 
instigator the more self-reduction (or “intro- 
jection” of hostility) in the pre-instigation 
communications. 


Conflict and Aggression 


Now let us consider some of the additional 
differences between the two experimental 
groups. The following question was asked at 
the end of the experiment: “Do you feel that 
you would like to continue with this note- 
writing?” Forty per cent of the No-Com Ss 
were opposed to continuing, whereas only 
5 per cent of the Com Ss were opposed. This 
difference is significant at beyond the .or 
level.” In other words, virtually all of the 
Com Ss were fairly well content with the 
situation, while a large proportion of the 
No-Coms were unhappy in the situation. It 
is possible that this difference may be at- 
tributable to relatively strong conflict in the 
No-Com Ss. In these Ss a strong tendency 
to approach behavior (aggression) was effec- 
tively restrained by a combination of inter- 
nalized avoidance tendencies (the result of 
cultural pressure toward inhibition of ag- 

6 This measure is computed by subtracting the per- 
centage of friendly units from the percentage of hostile 
units in the pre-experimental “personality sketches.” 

7 The distribution of SR was such that dichotomiza- 
tion was 

8In this and in all tests of 
two-tailed are used, either no 


of the direction of the results was made or 
the prediction was ad hoc 


necessary 


} 


subsequently reported 


significance tests since 


prediction 


Joun W. Tuisaut anv Joun W. Cou es 


gression) and, more important, removal of 
the communication channel to the instigator. 
Avoidance of the entire situation might be a 
means of resolving the conflict, although, ad- 
mittedly, renewed attacks on the barrier to 
communication might equally well be ex- 
pected. 

A similar situation appears to exist when 
“willingness to continue” is analyzed within 
the No-Com group. (Since only one of the 
Com Ss expressed unwillingness to continue, 
we are not able to study the variable any 
further for this group.) When a measure of 
post-experimental residual hostility ° is related 
to willingness to continue in the task, a 
biserial correlation coefficient of —.47 is ob- 
tained. This relationship is significant at the 
05 level. Thus it appears that the higher 
the level of hostility in this kind of situation, 
the stronger is the desire to leave the field. 
This may be interpreted by assuming that as 
terminal (post-experimental) hostility in- 
creases, internalized tendencies to avoid ag- 
gression increase even more sharply. How- 
ever, since we have no good measure of inter- 
nalized tendency to avoid aggression, this 
interpretation cannot be evaluated in this 
study. 


Newcomb’s Autistic Hostility Hypothesis 


The foregoing discussion of the relationship 
between hostility and avoidance of interaction 
can be encompassed in a general way in the 
autistic hostility hypothesis of Newcomb (9). 
This hypothesis states that a hostile attitude 
will tend to perpetuate itself by leading to 


a breakdown in the communication that 
would be requisite to altering the attitude. 
The first statement that the Newcomb hy- 
pothesis permits us to make is the following: 

Initially hostile Ss, when compared with 
initially friendly Ss, will initiate a smaller 
volume of communication during the pre- 
instigation period. To test this hypothesis 
let us consider the relationship between in- 
itial disposition toward the instigator and 
numbers of units of communication up to the 
time of definite instigation. For this purpose 
the total group of 41 Ss can be used, since 
the experimental variable operated only after 

® This measure comes from subtracting the percentage 


of friendly units in the post-experimental “personality 
sketches” from the percentage of hostile units. 
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the point of instigation. The product- 
moment correlation obtained was —.32, which 
is significant at beyond the .05 level. This 
relationship is consistent with our expecta- 
tion that the initially hostile Ss will tend 
to reduce their volume of communication.” 
The Newcomb hypothesis appears also to 
imply the follow statement: 

The initially hostile Ss, when compared 
with the initially friendly, will communicate 
a smaller amount of aggression after instiga- 
tion. This relationship would be expected on 
the same grounds as the reduced volume of 
communication before instigation. We as- 
sume that the initially hostile S is in a rela- 
tively strong conflict situation about being 
overtly aggressive to the instigator and that 
this conflict is resolved by avoidance. If this 
is so, we would predict a lowered responsive- 
ness to the instigator both before and after 
instigation. The data appear to bear out this 
supposition. For the Com group, initial hos- 
tility-friendliness was related to the frequency 
of aggressive units communicated to the in- 
stigator ** in the note sent immediately after 
instigation. The biserial correlation (dicho- 
tomizing aggression) was —.56, which is sig- 
nificant at about the .o2 level. The sign of 
the correlation conforms to the expected re- 
lationship.’* 

Quite independently of the Newcomb 
hypothesis, we would suppose that in any 
case the initially hostile S, as compared with 
the initially friendly, would show less in- 
crease in hostility after instigation, since the 
stimulus value of the instigator will be per- 
ceived to have changed less. The initially 
hostile S regards the instigator as a “low type” 
from the beginning and is not surprised to 
see his opinion confirmed. 


10 Kelley (6) has reported confirmation of this 
hypothesis in some experiments done with groups of 
students in classroom settings. 

11 This measure includes only categories AI and AIl, 
ic., only categories referring to aggression directed 
toward the instigator. See Table 2. 

12 Hence it appears that the same Ss were initially 
hostile to the instigator, relatively nonaggressive after 
instigation, and self-reductive before instigation. It 
seems clear from our data, as well as from other sources, 
¢.g., Sears’s(10) investigations of overt and fantasy 
aggression in children, that in predicting presence or 
absence of aggressive behavior, the particular social 
stimulus conditions must be taken into account. Any 
attempt to treat intropunitiveness, extrapunitiveness, and 
impunitiveness as bases for distinct personality types 
must be qualified by the finding of intra-individual 
variation under different but closely related conditions. 
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Now let us consider the boundary condi- 
tions imposed on the catharsis-conflict effect 
by the Newcomb hypothesis. To the extent 
that increments in hostility as a result of in- 
stigation are a function of initial disposition 
toward the instigator, we would expect the 
catharsis-conflict effect to be dissipated when 
the data for the Com group alone are an- 
alyzed. No dissipation of the effect would 
be expected in comparisons of the two ex- 
perimental groups, since initial dispositions of 
the Ss are equated. But within the Com 
group, if those Ss who interact less with the 
instigator (i.e., the initially hostile) also gen- 
erate smaller increments of hostility in re- 
sponse to the instigation, we would not ex- 
pect to find the catharsis-conflict effect operat- 
ing. 

When in fact, for the Com group, either 
a frequency or an intensity measure of 
amount of post-instigation aggression (de- 
rived from the last note written) is correlated 
with any of the measures of residual post- 
experimental disposition toward the insti- 
gator (as measured by the terminal person- 
ality sketch), no correlation coefficient exceeds 
.20 in either direction. Hence, these data offer 
no support for the supposition that the cathar- 
sis-conflict effect operates within the Com 
group. 

Summarizing the main limiting effects 
which the Newcomb hypothesis will have on 
the generality of the catharsis-conflict effect 
in the present situation, we may say: 

i. In general, opportunity for communica- 
tion after instigation (when compared with 
the lack of this opportunity) should lead to 
relatively less shifting in the direction of 
unfriendliness toward the instigator. This 
we would expect from the general statement 
of the catharsis-conflict effect. 

2. The amount of change from before to 
after the experiment in disposition toward 
the instigator should be a positive function 
of the initial friendliness of S toward the 
instigator. Initially hostile Ss would be ex- 
pected to change less than initially friendly 
Ss. This expectation comes from two pos- 
sible mechanisms, both of which are likely 
to be operating in the present situation: (a) 
the reduced interaction and heightened psy- 
chological insulation against the instigator 
which, from the Newcomb hypothesis, we 
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would expect the initially hostile S to bring 
to the situation; and (4) the relatively smaller 
change in the stimulus value of the instigator 
from the point of view of the initially hos- 
tile S. 

3. The difference in tendency to shift 
toward unfriendliness (proposition 1, above), 
in the comparison of those communicating 
after iastigation, should operate most strongly 
for initially friendly Ss, and least strongly, if 
at all, for initially hostile Ss. This qualifica- 
tion of the catharsis-conflict effect comes di- 
rectly from the application of proposition 1 
to proposition 2. The catharsis-conflict effect 
will be expected to be least strong for those 
Ss most resistant to change. 


N FRIENDLINESS 


re 


CHANG 








FRIENDLY 
INITIAL DISPOSITION 


Fic. 1. A CoMPARISON OF THE Two EXPERIMENTAL 
Groups IN CHANGES IN FRIENDLINESS TOWARD 
INSTIGATOR AS A FUNCTION OF INITIAL 
DIsPosITION 


Changes in friendliness are measured by subtract- 
ing frequencies of friendly units in pre-experimental 
sketches from frequencies in post-cxperimental 
sketches (hence, negative sign indicates a change 
toward less friendliness). Initial disposition is meas- 
ured by subtracting friendly units from hostile units 
in pre-experimental sketches. The straight lines 
were fitted by the method of least squares. 


Figure 1 summarizes the data which per- 
mit us to evaluate the three propositions 


stated above. The units obtained from the 
personality sketches are here presented as ab- 
solute frequencies rather than as percentages 
of each S’s total units. In considering the 
changes from pre-experimental to post-experi- 
mental sketches, percentage measures possibly 
would have predisposed the data in the direc- 
tion of invariance for the initially hostile Ss, 
since if an S is initially 100 per cent hostile 
he can obviously get no more hostile. There- 
fore, to avoid the possibly spurious effects of a 
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ceiling on the scores, the data were recast into 
absolute frequency scores. 

It will be observed also that the ordinate 
scale is in terms of changes from pre-experi- 
mental to post-experimental sketches in fre- 
quencies of friendly units. As we have seen 
in Table 1 this measure appears to be more 
sensitive than is the hostility measure to dif- 
ferences created by the experimental vari- 
able."* 

Inspection of Fig. 1 shows good general 
conformity between the data and the proposi- 
tions stated above. 

Proposition 1: Catharsis-conflict effect. At 
every point on the abscissa the Com group 
shows less loss of friendliness than does the 
No-Com group. 

Proposition 2: Resistance to change in the 
initially hostile. Initially hostile Ss in both 
experimental variations show less change than 
do initially friendly Ss. 

Proposition 3: Dissipation of catharsis- 
conflict effect among initially hostile. The 
differences between Ss in the two experimen- 
tal treatments are greater for the initially 
friendly and are negligible for the initially 
hostile. 


SUMMARY 


The purpose of this experiment was to test 
the hypothesis that overt acts of aggression 
in response to instigation to hostility will 
tend to reduce the level of hostile tension in 
the aggressor. Twenty-one Ss were permitted 
to communicate back to an instigator im- 
mediately after instigation to hostility, while 


18 The data summarized in Fig. 1 possibly reflect to 
some extent a regression of post-experimental friendli- 
ness toward the mean, as a result of imperfect relia- 
bility of the measures used. This tendency probably 
does not seriously disturb our interpretation, in view of 
the nonrandom nature of the post-experimental data, 
i.c., the systematic differences between the two experi- 
mental groups and the restriction of changes in friendli- 
ness to negative values (confirming the supposition that 
the instigation produced hostility). 
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another 20 Ss were not permitted this final 
communication. The former Ss showed rela- 
tively more post-experimental friendliness 
toward the instigator, thus giving qualified 
confirmation to the hypothesis. 

A second experiment showed that the dif- 
ferences between the two groups in the first 
experiment may also be a function of rela- 
tively heightened hostility in the group whose 
communication is thwarted. 

The effects predicted by the autistic hos- 
tility hypothesis of Newcomb are shown to 
dissipate the effects predicted by our major 
hypothesis as a function of the degree of in- 
itial hostility of the S toward the instigator. 
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THE RELATION BETWEEN EXPRESSED ACCEPTANCE OF SELF 
AND EXPRESSED ACCEPTANCE OF OTHERS * 


EMANUEL M. BERGER 


Student Counseling Bureau, University of Minnesota 


otinc a study in which Sheerer (5) 
N found a correlation between attitudes 

of acceptance of the self and accept- 
ance of others, Rogers (4) has suggested that 
such a relationship would be of significance 
to social psychology. Such a relationship 
would imply that self-rejection may be a 
factor in individual hostility toward groups or 
toward other individuals. Fromm (2), for 
instance, sees the disproportionate hostility 
expressed against Jews in Germany as being 
related to a kind of cultural self-rejection 
which, in turn, derived from an authoritarian 
tradition of upbringing that tended to sup- 
press spontaneity. It appears that such a 


relationship might supply social psychology 
with a principle which would be helpful in 
understanding and explaining problems of 
social conflict and hostility. However, before 
it can be used in explanation, there is a 


need to test whether or not the generalization 
can be made with confidence. The existing 
evidence is limited. 

For the most part, observations of a rela- 
tionship between feelings toward the self and 
feelings toward others have been based on 
clinical experience alone, and only very re- 
cently have there been any attempts to study 
this relationship systematically. Alfred Ad- 
ler (1) was probably among the first to make 
such an observation when he contended that 
“a tendency to disparage” arose out of feel- 
ings of inferiority as an overcompensation. 
More recently, Horney (3) has stated that the 
person who does not believe himself lovable 
is unable to love others. According to Fromm 
(2) we should love ourselves, for self-love and 
the love of others go hand in hand. More- 
over, he proposes that a failure to love the 
self is accompanied by a basic hostility 
toward others which arises out of the sup- 
pression of the individual's spontaneity or of 
his “real” self. Other evidence regarding the 

1 This presented at Amer. Psychol. Ass., 


Chicago, 1951, and was abstracted from a doctoral thesis 
completed in 1950 at the Univer. of Pittsburgh. 
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relationship of attitude toward the self and at- 
titude toward others has come out of studies 
done at the University of Chicago under the 
influence of Carl R. Rogers. Rather than 
being generalizations from clinical experi- 
ence, these studies have been attempts to 
measure, by the use of rating scales, the atti- 
tudes of individuals toward themselves and 
others. The data have been statements made 
by individuals in recorded counseling ses- 
sions. Using the method described above, 
Sheerer (5) found a “definite and substantial 
correlation between attitudes of acceptance 
and respect for the self and attitudes of ac- 
ceptance and respect for others.” Part of 
a study by Stock (6) confirmed the results 
obtained by Sheerer. It should be noted, 
however, that seven of the same counseling 
cases were used in both studies. 

It appeared to us that there was a need 
to test this relationship with larger numbers 
of cases and more varied samples than had 
thus far been studied. The problem was thus 
twofold: 

1. To develop a group instrument for the 
measurement of self-acceptance and the ac- 
ceptance of others. 

2. To test the relationship between these 
variables in a variety of groups, using the 
developed instruments. 


METHOD 


Construction of Scales 


Definition of tke variables. The following defi- 
nitions are essentially those used by Sheerer (5), ex- 
cept that they have been abridged and slightly modi- 
fied. 

The self-accepting person: 

1. Relies primarily upon internalized values and 
standards rather than on external pressure as a guide 
for his behavior. 

2. Has faith in his capacity to cope with life. 

3. Assumes responsibility for and accepts the con- 
sequences of his own behavior. 

4. Accepts praise or criticism from others ob- 
jectively. 

5. Does not attempt to deny or distort any feel- 
ings, motives, limitations, abilities or favorable quali- 


778 





ACCEPTANCE OF SELF AND ACCEPTANCE OF OTHERS 


ties which he sees in himself, but rather accepts all 
without self-condemnation. 

6. Considers himself a person of worth on an 
equal plane with other persons. 

7. Does not expect others to reject him whether 
he gives them any reason to reject him or not. 

8. Does not regard himself as totally different 
from others, “queer,” or generally abnormal in his 
reactions. 

g. Is not shy or self-conscious. 


The person who is accepting of others: 

1. Does not reject, hate, or pass judgment against 
other persons when their behavior or standards seem 
to him to be contradictory to his own. 

2. Does not attempt to dominate others. 

3. Does not attempt to assume responsibility for 
others. 

4. Does not deny the worth of others or their 
equality as persons with him. This does not imply 
equality in regard to specific achievements. He feels 
neither above nor below: the people he meets. 

5. Shows a desire to serve others. 

6. Takes an active interest in others and shows 
a desire to create mutually satisfactory relations with 
them. 

7. In attempting to advance his own welfare, he 
is careful not to infringe on the rights of others. 


Selection of ltems 


Using the various elements that made up the defi- 
nitions as a guide, statements about the self and 
others were selected? or constructed so that they 
conformed to these elements. 

It was decided that four items would be required 
for each element. This would be desirable in order 
to have the number of items usually necessary to 
give satisfactory reliability. Also, it permitted the 
matching of two halves of the scale for the pur- 
pose of obtaining matched-half reliability. 


The Preliminary Scales 


The preliminary scales consisted of 47 statements 
on self-acceptance, 40 on acceptance of others. The 
statements were modified where necessary so that 
strong agreement with them sometimes received a 
high score, sometimes a low score. Items from both 
scales were mixed together on the same form. 

The two scales together may be considered a 
“form.” At the top of each form were the directions 
which briefly stated the purpose of the test in a 
general fashion, told the subject (S) how to mark 
the I.B.M. answer sheet when responding to the 
scales, and emphasized that S need not put down 
his name, thereby encouraging him to answer as 
honestly as possible. 

Regarding the directions for marking the answer 
sheet, S was directed to blacken space one following 
an item number, if the statement on the test form 
was “not at all true of himself,” space five if it 
was “true of himself.” The intervening spaces rep- 


2 A tota! of 30 items was selected for the preliminary 
scales from a list of self-references collected for study 
by Margaret Hartley at the Univer. of Chicago. 
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resented intermediate degrees to which the state- 
ment was true for the individual. 

The scales are essentially self-administering since 
all the directions needed to respond to them are 
on each form. Each S had as much time as he 
wished to respond to the scales. 

Scores for any item ranged from one to five. If 
a response “true of myself” (space five) indicated 
high acceptance of self or others, then that response 
received a score of five. The response “mostly true 
of myself” (space four) received a score of four for 
that item, with three, two, and one representing 
lesser degrees of acceptance of self or others. When 
the response “true of myself” indicated low accept- 
ance of self or others, that response received a score 
of one, with spaces four, three, two and one receiv- 
ing higher scores. The definitions were the basis for 
determining which response indicated high or low 
acceptance of self or others. 

The answer sheets were scored by I.B.M. machine, 
using keys prepared for the purpose of giving half 
scores on each scale. 

An individual's score on a scale was his total score 
for all items on that scale. 


Item Analysis 


The preliminary scales were administered and 
scored for 200 Ss who were students in first-year 
sociology or psychology. Such classes are usually 
made up of students with widely different socio- 
economic backgrounds and vocational interests. 
Their ages ranged from 17 to 45. However, about 
go per cent of them were in the 17 to 3 age 
group. 

In doing the item analysis, those Ss whose total 
scores were in the top 25 per cent were compared 
on each item with those whose total scores were 
in the bottom 25 per cent. Thus there were 50 in 
each criterion group. The difference between the 
mean scores of the criterion groups was computed 
for each item. This difference was used as an index 
of the discriminating power of the item. The 
standard error of the difference of the quartile 
means did not exceed .30 for any item. All items 
used in the final form of the scales except three had 
critical ratios of 3.00 or more. The three had critical 
ratios close to 2.00. 


The Final Form of the Scales 


The final selection of items was made on the basis 
of the appropriateness of the items to the element of 
the definition and discriminating ability. Thirty- 
six items were selected for the self-acceptance scale, 
28 for the acceptance of others scale. 

Each scale of the test was divided into two 
matched halves. Items of both scales were then 
mixed together on each half in a random manner. 


Reliability 


Matched-half reliabilities were computed for the 
groups named in Table 1 except for the two small 
groups of speech rehabilitation cases and counselees. 
The Spearman-Brown formula was then used to 
estimate whole-test reliability. These estimates of 
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whole-test reliability were all .894 or greater for the 
self-acceptance scale, except-for one group for Which 
it was .746. For the acceptance of others scaie the 
estimated whole-test reliabilities ranged from .776 
to .884. 


Validity 


One approach to the validity of the scales con- 
sisted in having one group of Ss (N=20) write 
freely about their attitudes toward themselves, and 
a second group (N=20) write freely about their 
attitudes toward others. Both groups used the ele- 
ments of the definitions as a guide. No names 
were recorded, but Ss were instructed to write the 
same five-digit number on both the essay book and 
the answer sheet which they had previously used 
to respond to the scales. The paragraphs were then 
rated by four judges and the mean ratings for each 
individual were correlated with his scores oa the 
corresponding scale. The Pearson product-moment 
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would score lower on this variable. They did. the 
difference being significant at about the .o2 level. 
Empirical evidence from other studies also suggested 
that prisoners could be expected to score lower on 
self-acceptance as well. The difference between the 
groups on this variable was significant at better than 
the .or level. 

The small group of speech problem cases and the 
three counselees scored very low on self-acceptance 
when compared with college students of the same 
age, sex, and race. This difference, too, was in the 
direction expected on an a priori basis. 

The mean scores and standard deviations for the 
various groups are shown in Table 1. 

A third kind of data obtained tn order to get 
evidence of validity was that of a comparison of the 
average ranking of the members of the speech re- 
habilitation group (N=7) by clinical assistants, and 
the scores of this group on the scale. This was 
done for self-acceptance only, the assistants using 
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MEANS AND STANDARD DeVIATIONS FOR THE Various Groups 








SeLr-ACCEPTANCE 


ACCEPTANCE OF OTHERS 





Group 





MEAN SD MEAN 





Day-Session College Students 
Evening-Session College Students 
Prisoners 

Stutterers 

Speech Problems at Univer. of Pittsburgh 
Adult Class at Y.M.C.A. 

Counselees 


135-50 - 36 105.15 
142.63 -43 106.39 
128.45 -09 101.30 
141.36 -70 111.45 
116.43 -30 98.00 
128.77 -57 112.38 
102.00 -71 100.33 





correlation between scores and ratings was .897 for 
self-acceptance, and .727 for acceptance of others. 
Both of these correlations were significantly greater 
than zero. These measures were considered to be 
one means of testing whether or not the scales 
actually measured the conscious, expressed attitudes 
of the individuals. 

The average of the intercorrelations among judges’ 
ratings was .869 for self-acceptance, .769 for accept- 
ance of others. 

The second approach to validity involved com- 
parisons between different groups. The group of 
stutterers was expected on both an a priori and an 
empirical basis to score lower on self-acceptance. 
Although their average score was higher than that 
of the large group of students, when they were 
matched for age and sex with a group of non- 
stutterers, their mean was lower, the obtained ¢ 
being 1.97. This was .06 lower than that required 
for the .o5 level of significance. This suggests that 
age may be positively correlated with expressed self- 
acceptance. 

A comparison was made of the mean scores on 
acceptance of others by prisoners and by a group 
of college students matched for age, sex, and race. 
A priori considerations and also some empirical evi- 
dence had led to the expectation that prisoners 


the definitions as a guide for their rankings. A 
rho of .59 was obtained. This was not significantly 
higher than might have been obtained by chance. 
This test, then, did not support the other evidence 
of validity. 


REsuLtTs 


For the purposes of this investigation, an 
individual's acceptance of self was represented 
by his score on the scale for self-acceptance 
and his acceptance of others was represented 
by his score on the scale for acceptance of 
others. 

The Ss included 183 day-session college stu- 
dents, 33 evening-session college students, 33 
prisoners, 38 stutterers, 18 adults in a class at 
the Y.M.C.A., 7 speech problem cases in a 
rehabilitation program, and 3 counselees. 

The procedure was simply to have the 
various Ss respond to the two scales. Data on 
the prisoners were obtained by the prison 
psychologist. Data on the stutterers at Iowa 
and Purdue were gathered by clinical as- 
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sistants there with the permission of the clin- 
ical director. All other data were obtained 
by the writer. 

The Pearson product-moment correlations 
between expressed acceptance of self and ex- 
pressed acceptance of others is shown for the 
various groups in Table 2. 

All of the corrclations are significantly 
greater than zero at better than the .or level 
of confidence (p= .006 or less), with one ex- 
ception. That for the “Y” group just misses 
significance at the .05 level of confidence 


These results permit us to hold with greater 


confidence the generalization that expressed 
acceptance of self is positively correlated with 
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and this serves to support and strengthen the 
theoretical considerations of Fromm (2), 
Horney (3), et al. to the effect that acceptance 
of self is positively related to acceptance of 
others. 

A consideration of factors which might 
lower the correlations seems to be in order. 
For one thing, the variables measured are 
“expressed” attitudes toward self and others. 
Thus they are subject to such distortions as 
we know take place in individuals with re- 
spect to both their perceptions of themselves 
and their perceptions of their own acceptance 
of others. 

Also, it appears that as a group, girls tend 
to score higher on acceptance of others for 


TABLE 2 


CorrELATIONS BETWEEN ExpresseED ACCEPTANCE OF SELF AND ExprEssED ACCEPTANCE OF (.>=RS 
FOR THE Various Groups 








Group N 





Day Students 183 
Evening Students 33 
Prisoners 33 
Stutterers 38 
Y.M.C.A. Class 18 





* x=self-acceptance; y=-acceptance of others. 


expressed acceptance of others, thus answer- 
ing the question with which this research 
began. 

What can be said of the results as a con- 
tribution to psychological theory? Correla- 
tions cannot tell us about causation, but on 
the basis of theoretical considerations dis- 
cussed at the beginning of this paper, self- 
acceptance can logically be assumed to be 
the more basic variable in the relationship. 
With correlations for the different groups 
ranging from .36 to .69, residual variances 
(1—7*) would range from 52 to 87 per cent. 
In other words, variation in expressed self- 
acceptance accounts for a relatively small por- 
tion of the variation in expressed acceptance 
of others. It follows from this that the rela- 
tionship is not so invariable or of such a 
high degree that individual predictions can 
be made with accuracy, and we do not have a 
“law” in this sense. 

The results do, however, indicate a good 
degree of association between the variables 


a given score on self-acceptance. Probably 
this has a cultural basis, but in any event 
it seems likely that the correlations would 
have been higher if the sexes had been sep- 
arated and the sex differences taken into 
account. 


SUMMARY 


This study was undertaken to determine 
whether or not the evidence for a positive re- 
lationship between acceptance of self and ac- 
ceptance of others would be strengthened by 
an approach using larger groups and more 
varied samples than had previously been 
studied. In order to do this, scales were 
constructed to measure expressed acceptance 
of self and expressed acceptance of others. 
The scales were found to have satisfactory 
matched-half reliability and considerable evi- 
dence was found in favor of the scales’ valid- 
ity. The scales were administered to a variety 
of groups and correlations between accept- 
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ance of self and acceptance of others were 
determined. These were all very significantly 
positive, with one exception (p= .06). It 
was concluded that evidence for a positive 
correlation between acceptance of self and 
acceptance of others was definitely supported 
and strengthened by the results of this study. 
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CONCEPTIONS OF PARENTS HELD BY ADOLESCENTS’ 


THOMAS F. JOHNSON 


Purdue University 


identify and measure the concepts held 

by adolescents concerning family fig- 
ures. The approach to this problem was made 
by comparing the concepts, as expressed both 
positively and negatively, of adolescents who 
were separated from their families with those 
expressed by adolescents living with their 
families. The sentence completion technique 
was used as a measuring device. 

Psychology has been concerned with the 
development of the self at least as long as 
there has been interest in the individual and 
his behavior. As more emphasis has been 


T= primary purpose of this study was to 


placed on the study of the individual, there 
has been growing interest in how the indi- 
vidual sees himself in relation to those about 
him. The fact that the manner in which 
one sees himself or feels about himself is 
related to how he feels about others is no 
longer solely a matter of belief, hope, or con- 


jecture, but has begun to find some objective 
support and amplification (6, 8). Support for 
this view has become so well established that 
one author could say, “Obviously the ap- 
praisal of personality is based primarily upon 
behavior and in the role and status a person 
occupies in a group or society” (1, p. 3). The 
same author goes on to point out the inter- 
action between the individual and his en- 
vironment in the matter of role taking and 
role assigning, stating that this interaction 
process is equally obvious. 

The assumptions upon which this study 
is based refer to this interdependent rela- 
tionship between the individual's concept of 
himself and his concept of those individuals 
who people his world. Hence an adolescent, 


1A modification of a thesis submitted to the Depart- 
ment of Education and Applied Psychology of Purdue 
University in partial fulfillment of the requirements for 
the MS degree in Psychology. The writer wishes to 
express his gratitude to Professor L. M. Baker, under 
whose direction this investigation was conducted, for 
his support and helpful criticism in the writing of this 
paper. Acknowledgment is also made of the helpful 
criticism offered by Professors J. M. Hadley and F. J. 
Shaw. 
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if he has already come to a conception of self, 
may be viewed as having arrived at his con- 
ception through such life experiences as af- 
forded him by the attitudes and behavior of 
his parents and associates toward him, and 
through his interpretation of these attitudes 
and behaviors. Interaction with individuals 
in the environment is continually contrib- 
uting to modification of the self-conception 
and the conception of others, while at the 
same time interaction is modified by these 
conceptions. 

It is a further assumption that on the 
whole the relationships within the family 
constellation are the most vital for any one 
individual, and that this is particularly true 
of persons in the period generally known as 
adolescence. Thus family relationships read- 
ily attract our attention and perhaps provide 
the most obvious problems and, it is hoped, 
the most fruitful results for research in this 
area. 

Beginning with the above assumptions one 
might expect that the introduction of an 
unusual circumstance into the family rela- 
tionship, especially if it included forcible 
separation from the parental figures, would 
have a significant effect upon the concep- 
tion of parental figures and in a way that 
was consistently different from the concep- 
tions held by individuals who had not ex- 
perienced such a separation. 

In addition, those adolescents belonging to 
a group whose members have been forcibly 
separated from the parental figures and sent 
to a training school because of delinquent 
behavior were believed to have had behavior 
problems stemming from their relationship 
with authority of one sort or another. Difh- 
culties with parents over matters of disci- 
pline are probably always a part of these 
problems. 

If the foregoing outline is tenable, one 
might expect not only a difference in the 
conceptions about parents held by members 
of a group committed to an institution be- 
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cause of delinquent behavior and those held 
by members of a group living at home with 
both parents, but also that there might be 
a consistent difference between the two 
groups in a particular direction. 

For those in the delinquent group it was 
held to be likely that there was anger felt 
toward the parents stemming out of the dis- 
ciplinary problems. The unique relationship 
with the parents, at the same time, would 
probably make for the parents being con- 
ceived of as the most friendly, or the least 
unfriendly, important persons in the adoles- 
cent’s world. The enforced separation and 
such other factors as dependence upon adults 
who, at best, cannot supply the feeling of 
intimacy and support that one may expect of 
one’s own family may well operate to 
heighten this conception of the parents so 
that they are conceived of as being even 
friendlier persons than would be the case if 
the adolescent were in his own home. Fur- 
thermore, the lack of day-to-day contact 
would eliminate all opportunity for friction. 
The institutional staff become the persons 
who control, deny, or punish, and it is with 
them that the opportunity for friction is great- 
est. The conception of the parents might 
benefit by this contrast. 


HyportuHeses 


If it is assumed, then, that concepts of oth- 
ers are determined by social interaction it 
would be expected that (a) other things being 
equal, adolescents who have been separated 
from their families will differ from adoles- 
cents who are living with their families in the 
concepts held about family. It is assumed 
that feelings expressed toward family may 
be used as an index to these differences. One 
purpose of this study was to investigate this 
hypothesis. If it is assumed that enforced 
separation from figures who might ordinarily 
be expected to furnish support and affection 
heightens the attraction of these figures, it 
would then be expected «!:at (4) other things 
being equal, adolescents who have been sepa- 
rated from their families will differ from 
adolescents who are living with their fam- 
ilies in that they will show a significantly 
more positive attitude toward family. The 
second purpose of this study was to test this 
prediction. 
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For purposes of this study a family was 
held to consist of a father and mother and 
at least one offspring. While the family 
might include several members, the minimum 
would be three, in which case the subject 
(S) was the only offspring. In each case 
both parents were living at home. 


METHOD 


A number of sentence completion tests were in- 
vestigated (2, 4, 5, 7) and were discarded as not 
entirely appropriate for this study. Certain items, 
believed to have possibilities for use in this study, 
were selected from among these tests, others were 
added by the writer. One hundred and three stim- 
ulus items were obtained in this way. The stimulus 
phrases were intended to elicit statements about 
fathers, mothers, parents as a class of people, and 
family as a unit. A category designed to elicit 
statements about children was included. 

These phrases were then given to children in 
public school and the results were examined to dis- 
cover which sentence-stems most frequently yielded 
responses believed to indicate an emotional reaction, 
whether positive or negative, to the subject of the 
stem. 

Three classes were tested, each with a different 
list drawn from one-third of the 103 items. The 
classes numbered 21, 24, and 22 children respectively. 
A frequency of 16 emotionally toned responses, as 
judged by the investigator, was arbitrarily decided 
upon as sufficient to retain an item in the final list. 
On this basis, 34 items were retained. 

The categories father, mother, parents, and fam- 
ily numbered six items each. The remaining 
sentence-stems served to dilute the emphasis upon 
family and modify cumulative effects. The form of 
the stimuli was also modified further by eliminating 
all proper names. This was done to avoid associa- 
tions with specific persons known to S, but not 
members of his family. 

The sentence-stems were arranged in such a way 
that every category appeared once every seven 
times, but never in the same position within the 
group of seven. The resulting order was hecto- 
graphed using three pages per test with 14 items 
per page. 


Experimental Group 

The selected list was given to 113 parent-separated 
adolescents at a school for delinquent boys and girls. 
Only those having both parents at home were in- 
cluded. These Ss were in the 6th, 7th, 8th, oth, and 
roth grades and represented an age range from 11 
to 18 years. 


Control Group 


Each S in the experimental group was matched 
for age, sex, grade, and socioeconomic level with 
an S who was living at home with both parents. 
Socioeconomic level was inferred from -the father’s 
occupation with the aid of tables published in a 
recent work on social classes in America (9). 
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Forty-two matched pairs were secured, including 
32 girls, ranging in age from 13 years, 8 months to 
17 years, 3 months. There were 52 boys, ranging 
in age from 13 years, 8 months to 17 years, 5 months. 
The Ss’ socioeconomic levels ranged from lower- 
lower class to upper-middle class (9). 


Analysis 


The completions for a given sentence-stem were 
grouped together without reference to other stems 
so that consistency of rating for each individual stem 
might be enhanced. 

Ratings were made by six judges, all graduate 
students in clinical psychology, who were asked 
to evaluate the feeling tone of each response; i.c., 
does the response indicate that S has positive feel- 
ings toward father, mother, parents, family, or chil- 
dren? Responses which were unclassifiable as posi- 
tive or negative because of failure to respond, am- 
biguity, or neutral statements were marked with a 
zero. Each judge was asked to check, in the appro- 
priate column, each of the responses listed. Agree- 
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therefore decided to keep the count of each S mak- 
ing a certain type of response rather than of the 
total number of responses. 

Assuming the hypothesis that the population mean 
difference between the two groups of matched 
pairs was equal to zero, Student’s ¢ test was em- 
ployed. The .05 level of confidence was fixed as 
the acceptable criterion beyond which one might 
reasonably assume that the differences obtained be- 
tween the two groups studied were due to factors 
other than chance. A one-tailed test of the corre- 
lated means was run with 41 degrees of freedom, 
since direction was predicted. 


REsULTs 


The ratings obtained from the five cate- 
gories judged produced differences between 
the adolescents who were separated from 
their parents and the adolescents who were 
living with their parents, as indicated in 
Table 1. Those adolescents who were sepa- 


TABLE 1 


Drirrerences BETWEEN SEPARATED AND NoNSEPARATED ADOLESCENTS IN Expresstnc Positive FEELINGS 
TOWARD Five VARIABLES 








NONSEPARATED 
ApS EsceNntTs 


SEPARATED 


IABLE 
Var ADOLESCENTS 


DIFFERENCE 


MEAN 
P 





Parents 106 74 
Family 116 04 
Father 89 65 
Mother 102 93 
Children 64 66 


-76 -262 
+52 -234 
+57 .040 
+21 -913 
+05 -266 


<.0o1 
<.01 
<.001 
not significant 
Not significant 





ment among four of the six judges wes required 
to establish the rating for a given item. 

In addition to the judges’ ratings, a count was 
made to ascertain the frequency with which both 
groups of Ss made overt statements about status; 
described parents, father, or mother as authorita- 
tive or nonauthoritative; attributed anger to par- 
ents or family; referred to father’s occupation or to 
mother’s occupation (including household duties) ; 
and how frequently the father or mother was re- 
ferred to as being ill. 

The role of the mother was further studied to 
determine the frequency with which her role was 
stated as being that of teacher of manners or morals, 
and as a source of truth and right. Also, the roles 
of teacher of manners and morals and source of 
truth and right were combined on the assumption 
that there was possibly overlapping in these two 
roles and that both could be subsumed under the 
role of social guide. 

The proportions obtained on these variables refer 
to the proportions of Ss in each group overtly indi- 
cating attitudes about the above-named feelings and 
roles. Individual differences accounted for wide 
variability in the number of times a given S would 
express a certain attitude or interest, and, thus, one 
or two Ss could unduly influence the count. It was 


rated from their parents were judged to have 
expressed more positive feelings toward par- 
ents, family, and father more frequently than 
did those adolescents who were living with 
their parents. 

In the case of attitudes toward or feelings 
about parents, Ss who were separated from 
their parents expressed positive feelings 106 
times, compared with 74 expressions of those 
who were living with their parents. Sig- 
nificance was beyond the .or level of confi- 
dence. 

The Ss who were separated from their 
families likewise expressed positive feelings 
toward family 116 times, compared with 94 
expressions of those who were living with 
their families. Significance was, again, be- 
yond the .or level of confidence. 

These same Ss expressed positive feelings 
toward father 89 times, compared with 65 
expressions of those who were living with 
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their parents. Significance in this instance 
was beyond the .oo1 level of confidence. 

No significant differences were found be- 
tween the two groups in their expression of 
negative feeling toward parents, family, 
father, mother, or children. 

While it is recognized that proportions ob- 
tained from samples as small as .those em- 
ployed in the two groups studied must be 
viewed with caution, the findings are of in- 
terest, if only as suggestive of further research. 
Keeping in mind that the figures given are 
the proportion of subjects giving a type of 
response, the findings are presented in Table 
2. 


TABLE 2 
ProporTions oF REsPoNsEs JUDGED AUTHORITATIVE 
oR NONAUTHORITATIVE IN THE CATEGORIES 
Parents, FATHER AND MorTHER *® 








NoNAUTHORI- 
TATIVE 


AvuTnHori- 
TATIVE 


VARIABLE DIFFERENCE 





Parents .48 -71 +23 
-62 -64 +02 
Father -14 -83 -69 
.2i +76 55 
Mother -45 74 -29 
-5S 71 -16 





® Nonseparated (control) group data appear in italics. 


Discussion 


It has been demonstrated that the adoles- 
cents who were in the experimental group 
and separated from their families consistently 
expressed more positive feeling toward family 
than did the adolescents in the control group 
who were living with their parents. 

In view of the high degree of statistical 
reliability obtained in the analysis of positive 
ratings on the variables parents, family, and 
father, Hypothesis II cannot be rejected. 

The hypothesis that, other things being 
equal, adolescents who have been separated 
from their families will differ from adoles- 
cents who are living with their families in 
the concepts held about family was supported. 

This hypothesis was supported in three of 
the five categories considered. Separation of 
an adolescent from family life can affect him 
to such an extent that he will be motivated 
to express attitudes which differ from, those 
of the adolescent living within his family. 
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It is important to note that the difference 
is one of degree and not one of kind. 

It was further hypothesized that, other 
things being equal, adolescents who have 
been separated from th-ir families will differ 
from adolescents who are living with their 
families in that they will show a significantly 
more positive attitude toward family. 

This hypothesis was also supported in three 
of the five categories considered. Separation 
of an adolescent from his family may have 
the effect not only of intensifying attitudes 
toward the family, but also of sensitizing re- 
sponses in a given direction, namely, in a 
positive direction. 

The case records of the delinquents studied 
(experimental group) provided a record of a 
long list of abuses, neglect, and rejection by 
the parents. The members of the separated 
group did not withhold expressions of their 
sense of injury. Their protocols showed that 
they would like to see changes for the better 
in their relationships with parents as well as 
in the behavior of the parents, and yet, it was 
this same group that expressed most positive 
feeling. 

Both the experimental and the control 
groups gave frequent expression to negative 
feelings in the form of complaints, criticisms, 
or fears, and there were no statistically sig- 
nificant differences between the groups in this 
regard on any of the five variables tested. 
This suggests that the condition of separa- 
tion from the family for reasons of delin- 
quency, while not reducing or obscuring nega- 
tive feelings about the family, stimulated the 
delinquent who was in an institution to 
express more positive feeling toward family 
than did the condition of the nondelinquent 
who was living at home. 

At least four possible motives may be sug- 
gested to account for the group differences: 
(a) wishful thinking generated by feelings 
of rejection and serving to deny the rejection 
by an unconscious mechanism; (4) the effect 
of deprivation serving to intensify need; (c) 
guilt generated by feelings of hostility toward 
the parents, the emphasis on positive feeling 
serving as an unconscious denial of the hos- 
tility; (d@) a desire to indicate that the delin- 
quent has changed for the better in regard 
to parental authority and is ready to be sent 
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home. The latter might represent a con- 
scious attempt to influence authority.’ 

The manner in which the stimulus phrases 
were selected leaves the question of item re- 
liability open to serious consideration. This 
is particularly true for the categories mother 
and children. 


Additional Findings 


The findings obtained by means of a count 
of overt responses to such facets as socioeco- 
nomic status, anger of parents or family, 
authoritative or nonauthoritative parents, 
father or mother, reference to illness of 
father or mother, and the roles of the mother 
are only suggestive of further areas for study, 
since the N’s involved are too small to war- 
rant much statistical treatment, or to allow 
any conclusions to be drawn from the data. 
They are reported here, however, because they 
are suggestive (cf. Tables 2, 3, and 4). 

While differences between the groups in 
depicting parents, father, or mother as au- 
thoritative or nonauthoritative were relatively 
small, when within-group differences are con- 
sidered, the findings again become of interest 
(cf. Table 2). 

Those adolescents who were separated from 
their parents indicate that parents are more 
nonauthoritative, while those living with 
their families show little change in this re- 
spect: In other words, it would appear that 
it is more important to adolescents separated 
from their families to attribute nonauthori- 
tativeness to their parents and to minimize 
authoritativeness than it is to adolescents who 
are living with their families. Hence, at a 
time when adolescents are commonly held to 
be in rebellion against parental authority, the 
former group seems to deny that the author- 
ity is harsh, stern, denying, while the latter 
group is more willing to depict parental au- 
thority as harsh, stern, denying. 

When, however, we come to examine the 
case of the particular parent, i.e. father or 
mother, this difference does not appear. Both 
groups indicate that the father is a nonau- 
thoritative figure. Both also indicate that the 


2In a recent paper by Julian Meltzoff (3) the condi- 
tions under which a sentence completion test is adminis- 
tered were shown to have an effect upon the results 
obtained. There was evidence that the S was able to 
censor his responses. Rotter and Willerman (4) have 
also pointed out this fact. 


mother is frequently authoritative, although 
still generally nonauthoritative. Whether this 
is due to the mother’s having more respon- 
sibility for the rearing of the children, or due 
to the unreliability of the test, can only be 
speculated upon here, although the data on 
the maternal role seem corroborative. 

In the case of the variable occupation both 
groups made frequent reference to the father’s 
occupation either by name, description, or 
location, suggesting that the father’s role 
within the family is principally that of pro- 
vider. On the other hand, approximately 
half of both groups made frequent reference 
to the mother’s occupation, largely referring 
to household duties. It was for this reason 
that only the mother’s role was further stud- 
ied (cf. Table 3). 


TABLE 3 
ProporTIONs OF SEPARATED AND NOoNSEPARATED 
Groups REFERRING To ILLNESS AND OccUPATION 
oF FATHER AND MoTHER * 








VARIABLE FATHER MoTHER DIFFERENCE 





Occupation -90 .48 -42 
-93 -62 jt 
Illness 87 -12 -05 
~” -29 —.07 





* Nonseparated (control) group data appear in italics. 


The variable illness yielded only slight dif- 
ferences between groups when the father was 
considered. Over twice as many of the non- 
separated group made reference to mother’s 
illnesses as did the separated group (cf. Table 
3) 


In outlining the roles of the mother, the 
two groups showed little difference in the 
roles assigned. The major role assigned the 
mother was that of spiritual, moral, and be- 


havioral guide and leader. Next in order, 
and of nearly equal frequency, were the role 
of mother as a source of affection and her 
role as housekeeper (cf. Table 4). 

Where the auxiliary data reveal differences, 
the findings support the conclusion drawn 
from the major findings. 

The auxiliary data further suggest that ad- 
olescents may conceive of the role of the 
father as principally that of provider, while 
that of the mother is that of homemaker 
responsible for the care and training of her 
children. 


ete aang ee 


AOR POON Nee RENE AT OES CE 








Tuomas F, Jonnson 


TABLE 4 
Tue Roves or THE MoTHER 








Pro- 
PORTIONAL 
SEPARATED DIFFERENCE 


Non- 


Roxie ASSIGNED SEPARATED 


. Teacher of Morals 

and Manners -24 +2 -03 
. Source of 

Truth and Right .38 ‘ -05 

3.1and2combined . ° -02 

. Source of 

Affection 
. Housekeeper 
. Teacher of Skills 








It is believed that we may accept the find- 
ings in regard to the two groups studied with 
some degree of confidence. They are in 
agreement with, and furnish some statistical 
support to, the widely held impression that 
adolescents who are separated from their par- 
ents express more positive feelings for family 
than do those living with their parents. The 
findings do not, however, justify the con- 
clusion that this impression has been estab- 
lished as fact. The methodology used might 
prove to be very fruitful for studying such 
problems as the one attempted here. 


SUMMARY AND CONCLUSIONS 


The primary purpose of this study was 
to identify and measure the concepts held by 
adolescents concerning family figures. 

A projective test of the sentence completion 
type designed to elicit feelings and attitudes 
toward family, and in particular toward par- 
ents as the core of the family, was adminis- 
tered to 113 adolescents at a school for delin- 
quent boys and girls and to 111 public school 
students. Forty-two pairs, matched for age, 
sex, and socioeconomic status, were obtained. 
The responses were judged by student psy- 
chologists to be positive (suggestive of posi- 
tive feelings for parents and family), nega- 
tive (suggestive of negative feelings for par- 
ents and family), or neutral (evasive or 
ambivalent in their responses). 

Responses were tallied to obtain the fre- 
quencies with which overt responses were 
made regarding socioeconoruic status, anger 
of parents or family, authoritativeness of par- 
ents, vocation of parents, and illness of par- 
ents. In the same fashion the role of the 
mother was analyzed. The proportion of Ss 


in each group producing overt responses to 
these variables was obtained. 

The differences between the two groups 
revealed a greater expression of positive feel- 
ing by those adolescents who were ‘living at 
a school for delinquents. The differences 
were significant beyond the .o1 level of con- 
fidence for the expression of positive atti- 
tudes toward parents and toward family, and 
beyond the .oor level of confidence for the 
expression of positive attitudes toward the 
father (cf. Table 1). 

Proportional differences between groups 
consistently indicated a higher proportion of 
statements depicting parents, father, and 
mother as nonauthoritative were made by 
the experimental group (cf. Table 2). 

The experimental group (separated chil- 
dren) made less frequent reference to par- 
ents’ occupations, especially to the mother’s 
occupation. 

The control group (nonseparated children) 
made more frequent reference to mother’s 
illness (cf. Table 3). 

Both groups indicated closely similar views 
concerning the role of the mother (cf. Table 
4). The role of the mother as the principal 
source of affection was indicated by the pro- 
portions of .29 of the control group and .36 
of the experimental group. Thirty-eight per 
cent of the control group and 43 per cent of 
the experimental group indicated that they 
could expect the truth from their mother, or 
that what she did would be right. The mother 
was considered to be responsible for teach- 
ing manners and morality by .24 of the con- 
trol group and .29 of the experimental group. 
The mother as a teacher of skills was indi- 
cated by only .05 of the control group and .o7 
of the experimental group. 

It appears that for approximately half of 
the members of each group, “mother knows 
best” and is responsible for imparting this 
knowledge to her children. 

It also seems that important roles for the 
mother are those of source of affection and 
of housekeeper. 

The similar views of the two groups con- 
cerning the roles of the mother suggested that 
the cultural attitude was being reflected in 
their responses. 

The experimental group consistently em- 
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phasized the nonauthoritativeness of parents 
both as a unit and as individuals to a greater 
extent than did the control group. 

Both groups made statements about the 
father’s occupation to a greater degree than 
about the mother’s occupation, although the 
control group made more frequent statements 
about mother’s occupation than did the ex- 
perimental group. 

The control group made more frequent 
reference to the illness of both parents than 
did the experimental group. The difference 
was particuarly great in the case of the 
mother. 

The sentence completion test used was 
sensitive enough to reveal differences between 
the experimental (separated) and control 
(nonseparated) groups of children with re- 
spect to family as a unit; parents as the basic 
unit of family; and father qua father. Fur- 
ther use of this technique appears to be prom- 
ising when studying concepts concerning 
family relationships. 
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His is one of a series of investigations on 
the role of unconscious attitudes in in- 
terpersonal relations. The present 

paper is a sequel to two previous papers which 
dealt with transference and countertransfer- 
ence reactions in psychotherapy (4, 5). In 
the present study we investigated the effect 
of similar unconscious attitudes in social rela- 
tions. These postulated unconscious attitudes 
were measured by examining one person’s 
perception of another. In particular, we com- 
pared a person’s perceptions of those whom 
he likes with his perceptions of others whom 
he dislikes. 

Many investigators have shown that per- 
ceptions of ambiguous stimuli primarily re- 
flect the needs and attitudes of the perceiver 
(see 1). More recently, Bruner and Good- 
man (2), McClelland and Atkinson (8), San- 
ford (11), and others have used well-struc- 
tured stimuli which are presumed to have 
some affect attached. These investigations 
have demonstrated that even these stimuli 
are in systematically distorted 
fashion. 

The ultimate aim of most of the studies 
on idiosyncratic perception has been the in- 
vestigation of personal adjustment or of per- 
sonality organization in general. When we 
speak of personality organization or adjust- 
ment, we generally imply a person’s adjust- 
ment to others, or in relation to other persons. 


perceived 


1 This research was carried out in part under con- 
tract N6ori-27135 between the University of Illinois 
and the Office of Naval Research, U. S. Navy (Tech- 
nical Report No. 1). 

2 Fiedler and Blaisdell shared responsibility for the 
design of this research; data were collected by Blaisdell 
who also the preliminary statistical analyses. 
Warrington responsible for the final statistical 
unalysis of the data and shared with Fiedler in the 
interpretation and final write-up of this study. 

This paper was read in part at the Midwest. Psychol. 
Ass., Chicago, 1951. 

We wish to express our appreciation to Dr. L. J. Cron- 
bach and Mr. W. Hartmann for their suggestions and 
critical readings of the manuscript. 


made 
was 


BLAISDELL 


We are not particularly concerned about an 
individual’s adjustment in relation to inani- 
mate objects, pictures, or symbols. Rather, 
we are concerned about his behavior toward 
other persons, and the needs and attitudes 
which he manifests in relation to them. His 
behavior will be determined in large measure 
by how he perceives them. To study a per- 
son’s perception by using other people as 
stimuli seems, therefore, an important exten- 
sion of the investigations which utilize sym- 
bols or inanimate objects as stimuli. 

One important dimension which has been 
found in studies of therapeutic relations by 
Fiedler (4) and Fiedler and Senior (5) is the 
tendency to assume another person to be 
similar to or different from oneself. We have, 
therefore, considered it valuable to investi- 
gate this measure further in a social rela- 
tionship. 

Overestimation of similarity has been in- 
terpreted in these previous studies of psycho- 
therapeutic relationships as an indication of 
psychological closeness, empathy, or liking. 
Underestimation of similarity was interpreted 
as perceived psychological distance, coldness, 
or disliking. We, therefore, hypothesize that 
subjects (Ss) will. perceive fellow group 
members they like best as more similar to 
themselves than those they like least. 

Another aspect of interpersonal perception 
which concerns us is the tendency to perceive 
another person as similar or dissimilar to 
one’s ideal self. This was interpreted as 
measuring the degree to which the perceiver 
sees the other person as being the sort of 
individual he, the perceiver, would ideally 
like to be (4). This tendency is thus inter- 
preted as measuring the degree to which one 
person idealizes another. The concept of the 
ideal person in our culture is usually that 
of a psychologically strong, self-sufficient per- 
son who is able to give emotional gratifica- 


go 
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tion (5). The perception of a person in this 
light probably implies, therefore, an expecta- 
tion of a role which this other person will 
actually fulfill, ic, that he will gratify the 
perceiver’s emotional demands. Perceiving 
another person as psychologically weak would 
analogously imply perceiving him as needing 
support and succor. Hence, our second hy- 
pothesis that Ss will perceive persons they 
like best as also more similar to their ideal 
self than persons whom they like least. 

Thus far we have been primarily concerned 
with the way in which one person’s atti- 
tude and his perception of another person are 
related. It is quite likely, however, that per- 
sons like others because they are actually sim- 
ilar in the way in which they see themselves, 
or in the way in which they would like to be. 
In general, this follows the popular notion 
that people who feel the same way about 
themselves will like each other. We shall, 
therefore, test whether Ss are, in fact, more 
similar to the persons they like than to per- 
sons they dislike. This similarity will be 


measured in terms of self, and ideal-self per- 
ceptions. 


Our third hypothesis is that Ss will be more 


similar in self (or ideal-self) perceptions to 
those they like than to those they dislike. 

We are studying members of a social group 
from whom a short sociometric questionnaire 
was obtained. Each S was asked to name 
the person in the group he liked best and the 
one he liked least. We can, therefore, further 
inquire whether the more popular members 
of the group differ in their perceptions of 
others from the relatively less liked group 
members. Sociometric status is, in large part, 
an indication of the attitudes which a person 
evokes in others. Hence, it is probably re- 
lated to the personality organization of an 
individual. We should expect, therefore, that 
popular individuals, who presumably have 
better interpersonal relationships with their 
peers, will perceive others differently from 
persons who have poor relationships with 
their peers. Hence, our fourth hypothesis 
that Ss who are liked by others in their group 
will perceive their liked and disliked fellow 
group members in a different manner from Ss 
who are, themselves, relatively disliked. 

Finally, this study investigates two vari- 
ables in interpersonal perception which ap- 
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pear to tap important aspects of the thera- 
peutic relationship. For example, psychother- 
apists rated high for professional skill have 
tended to assume more similarity between 
themselves and their patients than therapists 
rated as relatively poor (4). Another investi- 
gation (5) suggests that patients tend to ideal- 
ize good therapists to a greater extent than 
they idealize relatively poorer therapists. Since 
the perception measures in the studies of 
therapeutic relations are identical to those in 
the present study on social relations, we can 
compare the social and the therapeutic rela- 
ships in terms of these two measures. 

The aim of this paper is thus fivefold, to 
test the hypotheses that 

I. Ss will perceive those they like as more 
similar to themselves than those they dislike. 

II. Ss will perceive those they like as more 
similar to their ideal than those they dislike. 

III. Ss are (in fact) more similar in self 
and ideal-self descriptions to those they like 
than to those they dislike. 

IV. Popular Ss will perceive others they 
like and dislike in a different manner from 
unpopular Ss. 

V. We shall compare therapeutic and social 
relationships in terms of the interpersonal per- 
ception measures with which this study deals. 


METHOD 
Subjects 


Twenty-six students who were members of one 
college fraternity participated in this study.4 The 
total group consisted of 27 full members and pledges. 
The 27th person, one of the authors (Blaisdell), 
was not tested. All Ss had considerable contact with 
each other and most Ss ate and roomed at the fra- 
ternity house. At the time of this study, all Ss 
had known each other rather well for at least three 
months. 


Instrument 


The test consisted of 76 statements descriptive 
of personality traits. These were selected from 
Murray’s list of personality descriptions (9, pp. 142- 
242). Two relatively short and unambiguously 
worded statements were chosen from each need 
category and mounted each on a separate index 
card.4 Statements such as, “I go my own way 
regardless of the opinions of others,” “I feel nervous 


8 We wish to express our sincere appreciation to the 
officers and members of Phi Kappa Psi fraternity at the 
University of Chicago for their wholehearted cooperation. 

*For list of statements order Document 3433 from 
American Documentation Institute, 1719 N_ Street 
N.W., Washington 6, D. C., remitting $1.00 for 45- 
mm. microfilm or $1.00 for 6 by 8 in. photocopies. 
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and anxious ‘n the presence of superiors,” were 
included. Stephenson’s O technique (12) was used. 
Each S was asked to sort the statements into eight 
categories consisting of 1, 5, 12, 20, 20, 12, 5, and 
I statement each. He was instructed to place the 
most characteristic statements at one extreme of this 
forced normal distribution, and the least applicable 
statements at the other extreme. 

Each S§ sorted the statements four times. He was 
asked to (a) describe himself, and (6) describe how 
he would ideally like to be. He was also asked to 
sort the statements, (¢) predicting how his best-liked 
fellow group member would describe himself, and 
(d) how his least-liked fellow group member would 
describe himself. 


Interpersonal Perception Scores 


Conventional tests are scored by comparing Ss’ 
responses with a key consisting of predetermined 
“right” answers. The scores of our interpersonal 
perception tests are, however, obtained by compar- 
ing one set of responses with another set of responses 
by the same S. S's self-description is compared, for 
example, with S’s prediction of another's self-descrip- 
tion. This provides us with one measure of S’s per- 
ception of another person, using S’s self-description 
as a point of reference. 

The various interpersonal perception scores and 
the method of obtaining them are described below. 

{Sp. S's self-description correlated with S's pre- 
diction of his best-liked (BL) choice’s self-descrip- 
tion. The “assumed similarity” to the positive 
choice 

ASn, S's self-description correlated with S’s pre- 
diction of his least-liked (LL) choice’s self-descrip- 
tion., The “assumed similarity” to his negative 
ch ce 
ideal-self description correlated with S's 
assumed 


ASI p. S's 
prediction of his BL’s self-description. The 
similarity of BL to S’s ideal-self. 

{SIn, S's ideal-self description correlated with S's 
prediction of his LL’s self-description. The assumed 
similarity of LL to S’s ideal-self. 

We also attempted to “correct” the 
measures by the “real similarities” in self- 
descriptions. For this purpose, as well as to test 
Hypothesis III, we needed to compute the correla 
i and ideal-self descriptions 
of each S§ and his choices. These “real similarity” 
measures were obtained as follows: 

RSp. The real similarity between S's self-descrip- 
tion and his positive choice’s self-description. 

RSn. The real similarity between S's and his 
negative choice’s self-descriptions 

RSIp. The real similarity between S's ideal-self 
description and his positive choice’s self-description. 

RSIn. The real similarity between S’s ideal-self 
and his negative choice’s self-description. 

Id/Idp. The real similarity between S’s and his 
BL choice’s ideal-self decriptions 

Id/ldn. The real similarity between S’s and his 
LL choice’s ideal-self descriptions. 

In order to obtain the “corrected” measures, we 
had to ask whether S had perceived his positive or 


assumed 


1 
similarity 


between the self 


tions 


negative choice as more similar or as less similar 
to himself than the other person really is. We then 
obtained a measure which has been named “Un- 
warranted Assumed Similarity” (UAS) or “Unwar- 
ranted Assumed Similarity to S’s Ideal” (UASI) in 
a previous paper by Fiedler (4). 

UASp5> This measure is obtained by subtracting 
RSp? from ASp? and retaining the signs. 

UASn is obtained by analogous operation, ASn?— 
RSn?*. 

UASIp.> This measure is obtained by subtracting 
RSIp? from ASIp?. 

UASIn is obtained by the operation ASIn?— 
RSIn?, again retaining the signs. 


RESULTS 


Hypotheses I and Il 

Assumed similarity. The first two hypoth- 
eses tested stated that Ss would perceive per- 
sons whom they like best as more similar 
to themselves than those whom they like 
least, and that they would similarly perceive 
persons whom they like best as more similar 
to their ideal self than persons whom they 
like least. 

To test these hypotheses, we compared Ss’ 
interpersonal perceptions of persons they 
liked best with their perceptions of persons 
whom they liked least. The significance of 
the differences was tested by Wilcoxon’s non- 
parametric test for ranked data from matched 
pairs (14). This test makes no assumptions 
concerning the population distributions, and 
takes into account the direction as well as 
the amount of the differences between paired 
variates. 

The results of this test are presented in 
Table 1. As can be seen, the variables AS, 
ASI, and UAS are significantly different at 
the .o1 level of confidence, and UAST is sig- 
nificant at the .05 level. All these support 
the hypothesis. 

We interpret these results to mean that Ss 
perceive persons whom they like best as more 
similar to themselves than others whom they 
like less, and also as more similar to their 
ideal. This hypothesis has received further 
confirmation in two subsequent studies (6) 
and may now be considered a fairly well es- 
tablished fact. 

5 While the mathematical properties of these measures 
are not yet completely understood, they have yielded 
interesting results in previous studies. There is con- 
siderable reason to doubt that this index is based on an 
adequate rationale. Other ways of studying the AS-RS 


patterns, which are mathematically more satisfactory, are 
required. 
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TABLE 1 
Susyects’ Perceptions oF Tuer Best- AND THeEmR Legast-LikeD CHOICES 
(N=26 Choosers) 


PERCEPTION OF 
Best-Likep 
CHOICE 
(Median Q correlation) 








PERCEPTION OF 
Least-Likep CR FROM 
CHOICE z* 


(Median Q correlation) 








+240 
-425 


-054 
082 


025 -460 
-120 +07 
—.o10 85 
.000 25 





test of significance used 


Hypothesis Ill 

Real similarity. This hypothesis stated that 
Ss would be more similar in self (or ideal- 
self) descriptions to those they like best than 
to persons whom they like least. Using the 
Wilcoxon ¢ test (14), we find no significant 
differences in actual similarities between the 
self-descriptions of Ss and persons they like 
and the self-descriptions of these Ss and the 
persons whom they dislike (Table 2). 

On the basis of these results, we cannot, 
therefore, say that persons tend to prefer oth- 
ers who are more similar rather than those 
who are less similar to themselves. Nor can 
we say that persons with relatively more sim- 
ilar ego ideals tend to be preferred over those 
whose ego ideals are more different. It must 
be remembered, however, that “real similar- 
ity” refers logically to a specific domain of 
attributes (3). Our statements probably do 
not adequately sample all relevant areas of 
personality which might affect one person’s 
preference for another. It is, therefore, not 
unreasonable to expect that other statements, 
or other methods, might demonstrate a rela- 
tionship between similarity on certain traits 
and likings. There is some reason to think 


here was Wilcoxon’s Matched Pairs Signed Rank Test (14). 


that present methods yield unreliable indices 
of “real similarity” (10). 


Hypothesis IV 

Popular vs. unpopular perceivers. This hy- 
pothesis stated that Ss who were liked by 
others would perceive persons whom they 
like and dislike in a different manner from 
Ss who were disliked. 

In order to test this hypothesis we divided 
our 26 Ss into a relatively popular and a 
relatively unpopular group on the basis of the 
number of positive or negative votes the Ss 
had received. A weighted ranking procedure, 
following the method proposed by Seeley 
(12), was adopted. Four Ss who received 
neither positive nor negative votes were 
dropped for the purpose of this analysis. 
In our analysis we thus compared two groups 
of 11 Ss each. One of these groups consisted 
of persons who had been more frequently 
chosen than rejected, the other group con- 
sisted of Ss who had been more frequently 
rejected than chosen. The two groups were 
compared on each of the four variables, AS, 
ASI, UAS, and UASI, for both their liked 
and their disliked choices. A total of eight 
significance tests was thus made. 


TABLE 2 


Rear SrmiLaritry Scores oF CHoosers AND TuHetr Best-Likep AND CHoosEeRs AND THEIR Least-Likep CHolces 


Rear SImMILariry 
oF CHOOSER AND 
VARIABLE His Best Likep 


(Median Q correlation) 


(N = 26 Choosers) 


Rear SIMILaRiry 
OF CHOOSER AND 
His Least Lixep 


(Median Q correlation) 








RS -135 
RSI +245 
Id/Id -340 


+093 
-990 
+410 





* Test of significance here used was Wilcoxon’s 


Matched Pairs Signed Rank Test (14). 
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TABLE 3 


CoMPARISON OF INTERPERSONAL Perception Scores Mabe sy Popurar aNnp Less Popurar Susjects * 


PERCEPTION SCORE OF 
PopuLar SuBJECTS 
(Ni=11) 
(Median Q correlation) 


VARIABLE 


PERCEPTION SCORES OF 
Less Popucar SuBJECTs 
(Ne=11) 


CR FROM 
U *e 
(Median Q correlation) 





ASp 
ASIp 
UASp 
UASIp 
ASn 
ASIn 
UASn 
UASIn 


-160 


190 


-250 
- 430 
.058 
.140 
.o8o 
- 180 
«000 


-005 





the c 
test of 


were § 


*None of 


** The 


ymparisons 
used 


significance was 

As can be seen from Table 3, none of these 
comparisons between groups was significant 
even at the minimally accepted .05 level. The 
second hypothesis has, therefore, not been 
confirmed. Thus, we cannot say that persons 
with relatively better interpersonal relation- 
ships perceive others differently (in terms of 
our measures) from persons with relatively 
poorer interpersonal relationships. 

In considering these negative findings, a 
number of points must be kept in mind. 
(a) We are dealing with a fraternity group 
which selects its members on the basis of lik- 
ing; hence the group has a highly restricted 
range. (4) In contrast to statistics applicable 
for testing the first three hypotheses, we have 
a smaller sample, and we could not use the 
more sensitive matched-pairs statistic. (c) 
Our method may not be the most relevant for 
the study of idiosyncratic perception of oth- 
ers. (d@) If this finding should eventually be 
confirmed under more appropriate condi- 


tatistically significant 
the Mann-Whitney U test for two unmatched groups (7). 


tions, it may mean that personality differ- 
ences (here measured by popularity) are not 
reflected in interpersonal perception. A more 
general finding of this nature would have far- 
reaching implications on present personality 
theory and tests based on interpretation of 
idiosyncratic perception. In view of the many 
limitations of our method, such an interpre- 
tation of our data cannot be justified at this 
time. 

This research also gave us the opportunity 
to test whether the similarity of “popular” 
individuals to their choices is different from 
the similarity of rejected individuals to their 
choices. These results are presented in Table 


Our data do not indicate decided differ- 
ences between accepted and rejected individ- 


uals in terms of their self and _ ideal-self 
descriptions. We do find that the more pop- 
ular Ss tend to be more similar to their 
disliked choices than the unpopular Ss. In 


TABLE 4 


Rear Srmmarity Scores oF Poputar SuByEcTs AND oF Less Poputar SuByEcTs 


Reat 


OF 


SmrLariry Scores 

Poputar SuByJECTS 
(Ni=11) 

(Median oO c 


VARIABLE 


rrelation) 


Reat Srmrcarity Scores 
oF Less Poputar SuByEcTs 


CR From 


(N2=11) u* 


(Median Q correlation) 


< 





RSp 
RSIp 
Id/Idp 
RSn 
RSIn 
Id Idn 


-I1 
.It 
+25 
.18 
-12 


+35 


-07 
.26 
+33 
-Or 
+09 


.48 


MnuU oOo asa 
oo URW 





* The test of significance used was 


the Mann-Whitney U test for two unmatched groups (7) 
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view of the multiple significance tests, this 
significant difference must be interpreted 
with caution. 

Comparison of perceptual measures UAS 
and UASI in therapeutic and social relation- 
ships. This paper deals with the same per- 
ceptual measures, UAS and UASI, which had 
previously been studied in the therapeutic 
relationship. These measures are apparently 
associated with important dimensions of the 
patient-therapist interaction, as is indicated 
by their correlation with the therapist’s rated 
competence. We shall here compare the 
therapeutic with the social relationship in 
terms of these two variables. 

As can be seen from Table 5, previous 
studies indicated a positive correlation be- 
tween the therapist’s rated competence and 
his UAS, but not between his competence 
and his UASI. The patient’s UASI was 
found to be positively correlated with the 
therapist’s competence, but the patient’s UAS 
was found to be unrelated to the therapist's 
competence. These results were interpreted 
as indicating that good therapists see them- 
selves as psychologically close to the patient, 
but they do not idealize the patient, i.e., they 
do not make emotional demands on him. 
On the other hand, the patient of the good 
therapist does not see himself psycholog- 
ically close to his therapist, but he idealizes 
him, i.e., he makes emotional demands on 
his therapist. 

In the social relationship the liked person 
is perceived as psychologically close, and he 
is also idealized to a relatively greater extent 
than is the case with the disliked person. 


TABLE 5 


CoMPARISON OF THERAPEUTIC wiTH SoctaL RELAa- 
TIONSHIPS IN TERMS OF THE PERCEPTION 


PERCEPTION VARIABLES 


PERCEIVER UAS UASI 





Good therapist's perception 
of patient 

Poor therapist's perception 
of patient 

Perception of good therapist's 
patient 

Perception of poor therapist's 
patient 

S’s perception of liked other 

S’s perception of disliked other 


High 


Low 





795 


In the therapeutic relationship we obtained 
a rank-order correlation between UAS and 
UASI of .08 (4). This same correlation in 
our group of Ss was .59 for UASp and UASIp 
and .26 for UASn and UASIn. It thus ap- 
pears that liking and idealization of another 
go hand in hand in social relationships while 
the therapeutic relationship tends to be uni- 
lateral in this respect. In general, these find- 
ings are in agreement with current theory 
of psychotherapeutic relations. 


Discussion 


This study provides answers to a number 
of questions which are of importance to our 
understanding of interpersonal relations. Our 
data indicate that we idiosyncratically or dis- 
tortedly perceive persons whom we like to 
be more similar to ourselves than those whom 
we dislike. These results, therefore, imply 
that we may eventually be able to utilize this 
method for the purpose of assessing personal 
preference on the unconscious level or in 
cases where overt expression cannot satisfac- 
torily be obtained (i.e, liking or disliking for 
a superior, or disliking of a family member). 

It may be questioned whether our variables 
AS, ASI, UAS, and UASI are, at least in 
part, measures of unconscious atttiudes, that 
is, if these responses appear to tap attitudes 
of which the S is unaware. 

It is almost impossible to demonstrate that 
an individual has an unconscious attitude, 
but we can sometimes adduce the existence 
of such an attitude. In this case, we gave 
Sa task requiring him to predict how another 
person would describe himself. Most persons 
in our culture pride themselves on being good 
judges of human nature. Since Ss were also 
told that the prediction would be compared 
with the other person’s self-description, they 
tended to become readily ego-involved with 
this task. The S’s set was thus very definitely 
one of making a correct prediction. In fact, 
among persons questioned in connection with 
a previous study, very few could imagine 
any other purpose for our procedure. When 
they were asked to estimate the direction or 
degree of the similarity they had assumed 
to exist between themselves and the other 
person, none did so with any reasonable ac- 
curacy. Ss usually felt that they had neither 
over- nor underestimated the real similarity in 
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self-descriptions. It is also unlikely that an 
S would or could arrange the 78 statements 
so as to yield a predetermined correlation 
with another description he may have made, 
while at the same time trying to predict some- 
one else’s responses. In light of this evidence, 
we feel fairly certain that the assumed sim- 
ilarity measures are indices of unconscious 
attitudes in this study. 


SUMMARY AND CONCLUSIONS 


This study has investigated the role of un- 
conscious attitudes in social relationships of 
a’ fraternity group. These attitudes were 
measured by determining the systematic dis- 
tortion or bias in one S’s perception of an- 
other. 

Ss of this study were 26 fraternity men. 
Each person named his best-liked and his 
least-liked fellow group member. He also 
described himself and his ideal self, and pre- 
dicted the self-descriptions of his best-liked 
and least-liked fellow group member using 
Q technique methods. Perceptual distortion 
was measured in terms of the similarity (or 
unwarranted similarity) which S assumed be- 
tween himself and his best- and least-liked 
person. 

Ss were divided into two groups, those 
sociometrically chosen and those sociometric- 


Perceptions of chosen and re- 


ally rejected. 
jected groups were compared. 

Results indicate that Ss perceive fellow 
group members they like best as more sim- 
ilar to themselves than those they like least. 
We found, similarly, that Ss perceive fellow 
group members liked best as more similar 
to their ideal-self than those they liked least. 

It was hypothesized that Ss will actually 
be more similar in self (or ideal-self) descrip- 
tions to those they like than to those they dis- 
like. Our results do not support this hypoth- 
esis. 

Nor was the hypothesis supported that Ss 
who are liked will perceive their fellow group 
members in a different manner from that of 
relatively disliked Ss. 
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Finally, we have compared the therapeutic 
and social relationships. Our results support 
the current view that the therapeutc relation- 
ship is unilateral in character in contrast to 
the more mutual social relationship. 
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DYNAMIC AND COGNITIVE FACTORS CONTROLLING 
MISPERCEPTION 


R. B. CATTELL AND P. W. WENIG 
University of Illinois 


ELIMITATION of the number and nature 
of the factors determining mispercep- 
tion has for some time been an urgent 

need, both for the guidance of effective 
specialized “pure” research on perception and 
for the practical improvement of “projec- 
tive” test techniques. 

Let us first consider the assumptions of the 
latter, thereby leading to a survey of theoret- 
ical principles in the former. The essential 
feature of all tests of this kind is the com- 
parison of an individual’s perception of a 
situation either with the demonstrably “true” 
perception or with some population norm of 
response. Since this is the one certain and 
common operational measure in such tests it 
has seemed wise to us, in a series of contribu- 
tions in this field, to drop such interpretive 
and question-begging terms as projective and 
to use misperceptive (2, 4, 10, 15). Appercep- 


tive is also acceptable, but the term misper- 
ception or misperceptive is better than apper- 
ception, because it reminds us that we are 
dealing with measurement of a discrepancy 


between an individual reaction and some 
super-individual standard of reaction. 
When the organization of research on the 
misnamed “projective” tests is thus insight- 
fully centered upon their true common fea- 
ture—misperception—the problems become 
contiguous with those of the theory of per- 
ception and a basis is provided for more pre- 
cise and progressive research than has hitherto 
characterized the interpretation of these tests. 
Unfortunately, the history of pure research 
on perception is marked by the brass instru- 
mentalist’s usually neglect of dynamic factors. 
This degree of neglect is equalled only by the 
“projective” clinician’s neglect of purely cog- 
nitive factors, a generation later. In 1944 
(4) the senior author of this article attempted 
a comprehensive statement of the factors in 
misperception inherent in the individual. 
Various limited experiments led to the de- 
velopment of this statement into a clearer 
theoretical position, recently stated systemat- 


ically (5), which it is the object of the present 
experiments to test. 

Briefly, it is recognized that the misper- 
ceptive response is a function of conditions in 
the stimulus situation as affected by (a) abil- 
ities, (4) experience (information, skills), 
and (¢) the dynamic needs of the subject (S). 
The analysis, as far as this simple but essen- 
tial threefold division, is summarized in Fig. 
1, which was primarily intended to “place” 
misperception test processes in due relation to 
other personality tests. 

In short, misperception can arise equally 
from failure of relation eduction (abilities: 
Spearman’s noegenesis [20]), failure of 
knowledge (either reproduction or experi- 
ence), and from dynamic distortions. The 
more novel and specific feature of the theory 
was the further analysis of the dynamic fac- 
tors in misperception into three main varie- 
ties and various subvarieties, as summarized * 
in Fig. 2. 

The precise meaning ascribed to each of 
these terms must be defined, as follows, if 
the subsequent experimental design is to be 
appreciated. 

Autistic distortion. Here the cognitive field 
is distorted better to agree with the person’s 
hopes and fears. If only his hopes, then it 
corresponds to a failure of the reality prin- 
ciple under the pressure of the pleasure prin- 
ciple; but if of fears also, then something 
broader than the pleasure principle must be 
hypothesized. 

Press compatibility distortion. This is best 
illustrated in existing experiment by John- 
son’s finding (12) that persons rate others as 
less friendly when themselves are in depressed 
moods than when in other moods, and, on a 


1The reader familiar with the history of this 
particular approach will, incidentally, recognize that 
this is a more comprehensive statement of what was 
initially approached (4) as three forms of projection: 
Naive projection from personal experience (“Poor in- 
formation” here); Press projection (“Press compatibility” 
here); and True projection (under “Defense mechan- 
isms” here). 
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less secure experimental basis, by Murray’s 
(16) report that frightened persons (four 
cases) rate others as more aggressive. The 
cognitive field is here distorted to fit in with 
(not to agree with, as in [1]) the individual's 
and desire, in the sense of making 
the emotion logically, socially understand- 
able. The need being satisfied is thus not 
the pleasure principle but the self-sentiment 
with its anxiety for self-consistency and cor- 
rectness. The term “press compatibility” 
simply defines the operation of striving for 
a compatibility with the external stimulus, 
“press.” 


emotion 


force or 


MIsPERCEPTION-T Est Processes IN RELATION TO PERSONALITY TESTING 


Ego defense mechanism distortions. The 
defense dynamisms do not require definition 
(11), and their modes of distortion of the cog- 
nitive field are well known. In general, the 
individual needs to misperceive the external 
situation in order to lessen the tension of 
repressing the rejected, unconscious drives, of 
which tension he is nevertheless unaware. 
These act only through unconscious dynamic 
forces whereas the two first, as far as is 
known, act through conscious needs. 

According to our hypotheses, the above five 
factors—deficient ability, deficient knowledge, 
autism, press compatibility, and ego defenses 
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—cover most misperception resident in the 
individual. An equally exhaustive analysis 
could probably be made of factors in the 
stimulus, of S instead of O in the general 
formula R = f(O,S), where R is the per- 
ceptual response. The classical work on color 
contrast effects, the Miiller-Lyer illusions, 
camouflage, regression to the real, etc., would 
fall in this category. 

Only one of the stimulus conditions seems 
to us of sufficiently outstanding importance 
for misperception tests io need mention here, 
and then only because it entangles very specifi- 
cally with dynamic distortion.. This concerns 
the neglected issue of principal and ground in 
the test presentation of pictorial themes. Prac- 
tically all such pictures necessarily present one 
person (the principal) with whom the S 
may identify, project, or at least maximally 
interest himself, and a field of surrounding 
objects and persons which are not intended 
to be so focal. The S’s estimation of (say) 
the aggressiveness of the principal will ob- 
viously depend upon whether he chooses, for 
example, to exercise projection on the latter 
or on persons in the “ground,” and even on 
his intellectual understanding of what the 
ground is. There are literally hundreds of 
“projective test” experiments in which this 
has not been recognized, studied, or con- 
trolled. In this research we mention it only 
to control it, being interested in factors in 
the S. 

In what follows we shall not experiment 
with the full 1o or 12 defense dynamisms 
(11), which would require a_ prohibitively 
large experiment, but shall take a representa- 
tive three, namely, rationalization, phantasy, 
and projection. The hypothesis we finally set 
out to investigate is therefore that the nature 
and magnitude of misperception effects gen- 
erally are to be accounted for by seven inde- 
pendent factors within the S, two being cog- 
nitive (intelligence, information), two oper- 
ant on consciously accessible dynamic traits 
(autism, press compatibility), and three being 
representative of the defense mechanisms. 
Our theory is that these are independent influ- 
ences, in the sense of being independent of 
one another and of the particular dynamic 
traits, ¢.g., sex, aggression, upon which they 
act (except for the above restriction on un- 
conscious traits). 
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An assumption of this kind is implicit in 
many psychoanalytic writings but it has never 
been tested, nor have its consequences been 
explicitly followed up. Similarly in the “prac- 
tical” field of clinical misperception tests, no 
cognizance has been taken of the fact that 
an increase in strength in one and the same 
drive might produce opposing, neutralizing 
misperception effects through the independ- 
ent action of these mechanisms. Incidentally 
the only senses in which “independent” has 
meaning experimentally are those of R 
technique, i.e., varying independently from 
person to person, and of P technique (9), 
i.e., varying independently within one person. 
The first proof, attempted here, is of the 
former, i.e., that when one manifestation 
of autism, for example, is greater in person 
A than B, then all other autism manifesta- 
tions are also greater, but other mechanisms 
vary independently. 


ResEarcH Desicn 


The hypothesis just reached may be alter- 
natively stated: the extent of misperception 
is proportional to the strength of the dynamic 
need and the strength of the various misper- 
ception processes. Hereinafter the seven in- 
fluences defined above will be called processes, 
while dynamic traits of any kind will be 
called needs. It is necesary to recognize 
that the strength of a need is itself a func- 
tion of (a) the strength of the stimulus situa- 
tion, (4) the deprivation leve! normally ob- 
taining for that individual in his life situation, 
and (c) whatever factors in the individual 
cause his drives, etc. to be inherently greater 
or less than another's. 

The last two cover what we call the 
“strength of the individual’s dynamic traits.” 
The first is also normally included therein, 
in so far as it concerns the fixed provoca- 
tions of the individual's life situation, but it 
may refer also to temporary stimulus situa- 
tions, provocative of moods, etc. In our ex- 
periment, as far as needs are concerned, we 
propose to consider their strength in terms 
of both (a) individual differences in inherent 
strength and (4) effects producible tempo- 
rarily by special stimulation. It has seemed 
to us sufficient for initial generalization about 
needs to represent them in this experiment by 
at least two. We accordingly chose self-asser- 
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tion (dominance) and sex, though some 
minor representation of other dynamic traits 
(play, superego) are also recognized. The 
choice of sex and self-assertion was directed 
also by the fact that these ergic patterns have 
so far been most clearly confirmed in recent 
factorial experiment (7, 5). 

The independent variables to be entered 
in the experiment are therefore as follows: 
(a) individual differences with respect to 
measures of self-assertion and sex, (4) indi- 
vidual differences on the seven process vari- 
ables, and (c) individual differences on tem- 
porary stimulus situations bearing on sex and 
self-assertion. It is presumed that changes in 
the dependent variable, amount of mispercep- 
tion, will depend on these. Another possible 
source of variation, direction of interpretation 
to principal or ground in the submitted per- 
cept, has been held constant, as indicated, by 
referring the S always to the principal. 

The treatment of these variables in the ex- 
periment is a fundamentally novel one, re- 
cently set out elsewhere (9) as the condition- 
response design. This combines the capacity 
of factor analysis to deal with several vari- 
ables along with the classical experiment’s 
control of a pair of variables. In this case 
it allows the inherent drive strengths and the 
process strengths to vary in the population 
as they will, while it controls the provocative 
“temporary” stimuli to sex and self-assertion. 
In order to pick out the distinct drives and 
the distinct processes as independent factors 
(if they exist as in our hypothesis) it is neces- 
sary for each to be represented in the experi- 
ment by at least two “markers,” as we shall 
call them. For example, we represented the 
sex drive, not by a single variable, but by 
two major and some lesser variables. For a 
common factor requires at least two well- 
loaded variables to distinguish it as a factor, 
and these two variables must correspond with 
the psychological character hypothesized in 
the factor if the hypothesis is to be confirmed. 

Thus the set of variables to be intercorre- 
lated for the eventual factorization included 
(a) measures of the strength of sex and self- 
assertion in themselves, (4) measures of these 
two needs reacting in all possible combina- 
tions with the seven process variables, (c) 
measures of the “pure process” variables in 
abstraction from these two needs, and (d) 
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measures of the strength of the stimuli for 
sex and self-assertion. The last were alone 
“controlled” and were therefore distributed 
randomly throughout the sample. The use 
of (a) and (c) above will be perceived to be 
similar to what has been called criterion 
analysis (distinguished from criterion rota- 
tion [9]). 

Any complicatedness in this design will be- 
come clearer as we now describe actual pro- 
cedures. Most attention should probably be 
given here to the tests by which we proposed 
to measure the various misperceptive proc- 
esses for these devices, already set out and 
proved in other researches (2, 4,5), which are 
likely to be of practical value to clinicians, in 
view of the present findings. 

Essentially, the experimental procedure, as 
far as the “process” measurement was con- 
cerned, consisted in presenting S with a pic- 
ture and asking for an interpretative response. 
But selective rather than inventive (open- 
ended) responses were invited and eight al- 
ternatives were given, each representing a 
predominance of one of the processes. (The 
number became eight rather than seven be- 
cause the “information” category was split 
in two as shown below.) Twenty such pic- 
tures were presented at a session, permitting 
a score up to 20 on any one process. (We as- 
sumed that with as many as eight alterna- 
tives no significant spurious negative correla- 
tion among the alternative processes would 
be produced.) A distinct set of 20 was pre- 
sented on a second occasion, thus providing 
the second of the marker variable scores for 
each of the processes.” Twelve of the 40 were 
standard TAT cards; the rest were similar in 
style. 

This part of the test procedure thus cov- 
ered eight processes, defied as follows: 


Hypothesized Misperception Processes 


1. Cognitive misperception through deficient in- 
telligence (Naive Misperception No. 1). An intelli- 
gence test was used here. 

2. Cognitive misperception through deficient 
knowledge or ability to reproduce associations (Naive 
Misperception No. 2), (@) in relation to the field. 
Measured by a test of general life experience. 

3. Cognitive misperception through deficient 


2 Actually 22 pictures were presented on each occa- 
sion, two being concerned with extraneous issues, 
namely with noting any effects due to specificity of 
occasion. 
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knowledge or ability to reproduce associations 
(Naive Misperception No. 3), (4) in relation to the 
principal person. Measured by a test of self-insight 
—the reciprocal of the difference, regardless of sign, 
between S’s own estimate of his sex and self-assertive 
erg strengths and the strengths as given by the 
experiment. 

4. Autistic misperception. (Test: see below.) 

5. Press compatibility misperception. (Test: see 
below.) 

6. Ego defense misperception, 
(Test: see below.) 

7. Ego defense misperception, (4) Rationalization. 
(Test: see below.) 

8. Ego defense 
(Test: see below.) 


(az) Projection. 


misperception, (c) Phantasy. 

Naive misperception through deficient ex- 
perience (knowledge, reproduction) is split 
into two (Naive 2 and 3) because Sears’ (18, 
19) work may be interpreted as indicating 
especial importance, for misperception, of ig- 
norance about the self. We inserted (later) 
a special criterion measure of ignorance about 
the self, and tested here in the misperception 
tests the hypothesis that this would show 
itself most emphatically in misperceptions in- 
volving the principal person in the picture, 
as one might reasonably expect. This is the 
only place in which any analysis is attempted 
here of principal-ground differences. A de- 
scription of the items by which the various 
misperceptions above were actually measured 
is set out in the next section. Here we pro- 
ceed with the over-all design. 

The first 20 pictures all had interpretations 
(“reasonable” ones) involving greater or 
lesser sex perception, while the second 20 
permitted greater or lesser score on self-asser- 
tion. 

This suffices to indicate the general design: 
16 tests of eight processes, simultaneously 
yielding two measures of strength of two ergs 
and two markers for each of the eight proc- 
esses. In addition, five have process measures 
(autism, information, phantasy, etc.) which, 
as indicated above, were used for each erg, 
plus an intelligence measure. Two of the dy- 
namic processes could not be brought into 
pure, “criterion” process form. The pure 
ergic strength measures will be described 
later. One of the ergs, self-assertion, was 
additionally measured by giving to the Ss 
what seemed the six most relevant factors of 
the 16 Personality Factor Questionnaire (6), 
which in addition provided a measure of five 
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other dimensions of personality needing to 
be kept in view, if only to prove the inde- 
pendence of the present needs and processes 
of these general temperament and character 
dimensions. Actually, one would reasonably 
hypothesize that proneness to using the de- 
fense mechanisms generally might be asso- 
ciated with C factor, and the use of projec- 
tion with the paranoid factor L. 


MIsPERCEPTION Test DeEsiIcNs 


Examples will now be given of a test item 
contributing to each of the variables, along 
with a statement of design. For purposes 
of illustration they are more obvious, less 
veiled, than in the body of the tests as pre- 
sented to Ss. 

Incidentally, the definition of the two needs, 
sex and assertion, running through these 
items is the usual dictionary one, aided by 
factor-analytic evidence of the tendencies in 
dominance (3, 6, 7) and in sexuality (8). 
Thus the former includes self-assertiveness, 
boasting, conceit, breaking of conventional 
rules, pride in attainments, liking to master 
things and people; while the latter includes 
desires to make love to a beautiful woman 
(for men), enjoyment of good movies and 
plays, enjoyment of love novels, etc. 

The operations of misperception acting 
through these motivations can best be de- 
fined with the illustration of each, as follows: 

Cognitive misperception (Naive Misper- 
ception No. 1). On the basis of the hypotheses 
set forth in this paper, one would expect 
cognitive misperception, a result of deficient 
intelligence, to correlate negatively with the 
intelligence factor. The following items, de- 
signed to measure this tendency, depict rela- 
tionships which do not exist in the pictures 
and which would appear highly improbable 
or impossible to an intelligent person: 


1. The young woman and the old woman are 
watching a person do what they have forced the 
person to do. 

2. The girl in the picture is listening attentively 
while her mother tells her the facts of life. 

Naive misperception (field) (Naive Mis- 
perception No. 2). Theoretically, one can 
separate a person’s intelligence from his level 
of “general information.” Many differences 
of interpretation on so-called projective tests 
certainly reflect nothing more than differ- 
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ences of information. To test this hypothesis 
most clearly an investigator would need to 
construct items around esoteric information. 
Because of the multiplicity of variables here 
being investigated, all of which operated on 
the same picture, it was very difficult to select 
or design pictures to allow all variables to 
function. Therefore, few cues of this esoteric 
nature were present in the pictures. Never- 
theless, an item designed to measure misper- 
ception of this kind was included for each 
picture. 

In the picture shown for each group of 
items which included the following, such in- 
formation was included in the pictures which 
would render these statements incorrect. 

3. The man clothed in the long robe is Jesus who 
men to throw down their arms and 
(The men were holding walking 


is forcing the 
be peace ful. 
staffs. ) 

4. The girls standing around the life guard are 
attracted to this man very much since he's a movie 
star at the beach in Hollywood, California. (A 
sign appears in the picture which reads “Coney 
Island Bea h rn 
(Naive 
Misperception No. 3). To ensure that naive 
misperception is the chief influence at work, 
it is necessary to choose examples where ade- 
quate knowledge would rule out interpreting 
by naive projection of the self. Thus: 


Naive misperception (persons) 


5. The minister preaches to them (his congrega- 
tion) on vice because he, like other people, is a 
dominant person and this is an opportunity for him 
to express his authority. 

Here it is assumed that anyone familiar 
with the religious training and ideals of the 
ministry would hesitate to interpret the be- 
havior as that of deliberate dominance for 
personal gain, no matter how dominant he, 
himself, might be by nature. Similarly the 
knowledgeable person would avoid the mis- 
perception of considering the following to be 
motivated by romantic love: 


6. The missionary is talking to them (a group of 
African natives) about love because she is attracted 
to these men just as all girls are attracted to boys. 

The positive score on naive projection is 
italicized in each case. 


Autistic misperception. Although misper- 
ception can be made to operate more read- 
ily upon a field than a principal, our design 
necessitated misperception with regard to 


both. Since such misperception is depend- 
ent upon identification with the principal, it 
might initially have been more desirable to 
investigate the relation with the field, and ex- 
clude that of the principal. This-desire was 
opposed by the need for as much similarity 
as experimentally feasible between our experi- 
mental design and the procedure generally 
followed by clinicians in using tests employ- 
ing misperception, or projective techniques. 
7. The woman in the picture is dominating the 


man because he wants her to ... he enjoys being 
dominated. 


Here the wishes of the dominant person are 
autistically satisfied if others invite his com- 
mand. 

8. The girl in the center of the children is a@ 


very attractive girl who is a social success ...@ 
very popular girl. 


Press compatibility misperception. Here the 
dominant person is more likely to believe oth- 
ers to be submissive and vice versa. Although 
it is somewhat difficult to distinguish the 
effects of this mechanism from autistic mis- 
perception, since both favor (in the dominant 
person) an external situation inviting dom- 
inance and (in the submissive person) an 
external situation which is dominating, the 
distinction between the two was made on 
the following basis. Misperception occasioned 
by autism is by definition a pleasurable ex- 
perience. This process results in the satisfac- 
tion of dynamic needs in S. Press misper- 
ception, however, while also satisfying a need 
(that to maintain one’s self-respect and secu- 
rity) is a more urgent process, one akin to 
rationalization. For here the individual is 
confronted with a mood state which he must 
explain in terms of his psychological logic, 
based upon his past experience. This element 
of necessity has been accepted as a point 
of difference between the two processes and 
magnified in the design of the items. The 
essence of the difference, as accepted, is joy 
and satisfaction in autism vs. duty and ra- 
tionalization in press. 


9. The man kneeling at the boy’s side is dominat- 
ing the boy because the boy is a very submissive 
person which necessitates that he be led. 

10. The woman in the picture is becoming amor- 
ous because of the extreme attractiveness of this 
man. 
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Projection. In true projection it is neces- 
sary to make clear in the item content that the 
behavior in question is socially reprehensible; 
for projection, by definition, acts upon the 
repressed. 

11. The woman is telling the man what to do 
because, although to be such is bad, she is a dom- 
inant person as are so many people. 

12. The woman in the picture is a woman who 
enjoys and seeks out romantic love and should be 
criticized for this immoral behavior. 


Rationalization. Little discussion is re- 
quired to clarify this process. Again one has 
to choose, if possible, rather antisocial be- 
havior rooted respectively in dominance and 
romantic love and set out good rationaliza- 
tions for it. 

13. The older man is the boy’s father who domi- 
nates the boy for his cwn good. 

14. The woman in the picture is in love with the 


man and wants more than anything else to marry 
him ... but she wants this only for his good. 


Phantasy. Once more one must choose rep- 
rehensible behavior. However, this mecha- 


nism is concerned solely with symbolic satis- 


faction for here the individual “dreams” of 
the satisfying behavior. This misperception is 
occasioned by S’s tendency to impute such a 
mode of satisfaction to the principal. 


15. The girl behind the tree is the running girl’s 
nurse who dreams of completely possessing her 
patient and making her do anything she wants her 
to. 

16. The woman in the picture dreams of being 
the most attractive, beautiful, sought-after woman 
in the world with all men making love to her at 
her desire3 


EXPERIMENTAL PROCEDURE 


The tests illustrated above were administered to 
67 undergraduate women at the University of IIli- 
nois, in three meetings, each of three consecutive 
hours. Twenty-two pictures for the first erg were 
given at the first, 22 for the second erg at the second, 
and the essential “process” and pure erg measures, 
including the 16 Personality Factor (P.F.) test, at 
the third. 

It remains to describe the application of the 
stimuli intended temporarily to increase the 
strength of the two needs, differentially for various 
Ss. It was planned to strengthen the transient 
moods for both (a) dominance and (4) sexuality 


8 Note that here as elsewhere it is not possible to 
economize items by setting up the dominance and the 
romantic responses as alternatives in a single item, 
since this would produce a spurious negative correlation 
between them. 
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by selected movies. A different “dose” was given 
different Ss and scores on the independent indi- 
cator were calculated as the inverse of the number 
of movies each S saw. The following movies were 
used to provoke a mood of dominance: Teddy the 
Rough Rider, a film depicting the later political 
life of Theodore Roosevelt; Romance of Radium, 
the story of the success of the Curies and Becquerel 
in their struggle to isolate radium; Story of Alfred 
Nobel, the ideals and success of Alfred Nobel (once 
again a film in which persistent expression in the 
face of many barriers is rewarded) ; Give Me Liberty, 
a film depicting the pre-revolution era in the Ameri- 
can Colonies in which the bulk of the film was 
devoted to the main actor’s presentation of Patrick 
Henry’s famous speech; One Against the World, the 
story of Dr. Ephraim McDowell who, while con- 
fronted with a mob threatening to hang him, per- 
forms the first major operation on a human being. 

The following films were used to provoke a mood 
of romance: The Schumann Story (an adaptation 
from Song of Love), the heartwarming story of 
Robert Schumann's immortal compositions and the 
devotion of his wife Clara; Romeo and Juliet, an 
edition of the Metro-Goldwyn Mayer 1936 produc- 
tion. (This edition limits itself to those sequences 
which deal directly with the love story of Romeo 
and Juliet.) 

These were played to the Ss immediately before 
responding to the tests, and, in accordance with the 
condition-response design (9), the condition (“ro- 
mantic love” in the first case) was applied in highest 
degree to one fraction of the population (one quar- 
ter), a lower degree to a second fraction, and so 
on, giving every person a score on the range 1 to 
4. The population was subjected on the second ses- 
sion to different degrees of provocation of self-asser- 
tion. The scores for the individuals on these two 
provocations were correlated in what we have vari- 
ously called “criterion,” “independent indicator,” or 
“condition” variables. 

As indicated earlier, among the various significant 
analyses of dynamic need strength are those into 
temporary mood and “permanent” strength, and into 
conscious and unconscious. Idealfy we would have 
liked to complete our design by estimates of the 
fraction of the strength of the given ergs in each 
individual tied up in expression through uncon- 
scious complexes. But any prolonged psychoanalytic 
investigations of our Ss were unfeasible and would, 
under most circumstances, have led to only dubious 
quantitative evidence. 

Accordingly, our experimental work concluded 
with 48 variables, as follows: two sets of misper- 
ception process variables, eight in each; one ergic 
stimulus-level variable; six personality factor meas- 
ures from the 16 P.F. test (Factors B, C, E, H, O, 
and L), one of which, E, was also used as an ergic 
marker; one measure of frequency of checking vari- 
ables, four additional ergic markers (items designed 
to obtain measures of drive strength without the use 
of process or misperceptive mechanisms), 21 essen- 
tial process measures (non-misperceptive measures 
of the processes under investigation). 

The technicalities of the factor analysis can be 
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briefly defined and dismissed as product moment 
correlation; multiple group extraction with three 
iterations required to stabilize communalities; resid- 
uals indicating a minimum of eight factors to be 
extracted; and a reasonably definite, simple-structure 
resolution (50 per cent of variables:in the +.10 
hyperplane on an average) obtained by blind rota- 
tion after 13 over-all rotations.* 

The rotated (simple structure) reference vector 
matrix, from which the factors described below are 
abstracted, is set out in Table 1. Of the remaining 
matrices,> only that giving the direction cosines 
among the factors (reference vectors) is given here 
(Table 2), and it will be seen that these factors are, 
within the limits of error, independent or uncorre- 


lated. 


Nature of the Factors 


As usual, we shall attempt to arrive at the nature 
of the factors by listing the highest loaded variables, 
out of 48. Where the loading is negative the verbal 
label of the variable will be reflected (e.g., sub- 
missiveness instead of dominance) to be consistent 
with description of the positive pole. A “coupled 
title,” e.g., Phantasy Dominance describes the process 
variable for phantasy as measured in one set of eight 
misperception scores, the set all concerned with the 
erg of self-assertion or dominance. All “coupled” 
titles thus refer to scores of misperception on twenty 
pictures. Marker Dominance refers to the intended 
pure measure of dominance, unmixed with any 
process, and so on. 


Factor I, Phantasy 
Variable Loading Definition 
42 82 Phantasy Dominance 
31 53 Marker Mating 
: Rationalization Mating 
41 Phantasy Mating 
43 Phantasy Play 
17 Factor O (Cattell’s 16 P.F.) 
40 Autism Fear (low) 
30 Rationalization Dominance 


Our first factor seems to justify being called 
phantasy, for three of its eight highest loadings are 
phantasy variables and that variable with the high- 
est loading is a phantasy marker variable, phantasy 
dominance. A set of variables connected with the 
mating erg have a lower loading here, but as a 
very clear factor (VI below) already defines this 
drive, we can only suppose a secondary association 
between strength of sex erg and tendency to phan- 


*The authors with to express their great indebted- 
ness to Dr. David R. Saunders for the technical ex- 
cellence of the factorization and rotation which he 
independently carried out in the Statistical Service 


To save printing costs, the rest of the matrices— 
correlation, unrotated factor, and transformation—have 
been deposited with the American Documentation Insti- 
tute. Order Document No. 3485 from American 
Documentation Institute, 1719 N St. N. W., Wash- 
ington 6, D. C., remitting $1.00 for 35-mm. micro- 


film or $1.20 for 6 by 8 photocopies. 


tasy. The best hypotheses here, in view of the recent 
demonstration of an independent, narcissistic, sex 
drive factor (7, 8), is that proneness to the defense 
of phantasy is specifically associated with the strength 
of this narcissistic component in sex rather than with 
sex as such. 


Factor II, Dominance (Self Assertion) 
Variable Loading Definition 
26 81 Marker Dominance 
35 51 Phantasy Dominance 
42 47 Phantasy Dominance 
22 45 Factor E (Cattell’s 16 P.F.) 
Dominance 
18 43 Phantasy Mating 
10 —42 ~ True Projection Dominance 
(little projection from the 
unconscious) 
—4I Superego Strength Domi- 
nance (little superego cen- 
sorship of dominance)® 


This factor appears to be the erg of self-assertion 
or, in other words, the dispositional trait of dom- 
inance. For it contains in the highest seven loadings 
six of the variables concerned with dominance be- 
havior, one of them, the highest, being a marker 
for dominance. Further, the measure of the person- 
ality factor for dominance, on the 16 P.F., comes 
out here, and only here, with a significant and 
substantial loading. 


Factor III, Freedom from General Neuroticism 
(A second-order factor from Cattell’s 16 P.F. Test.) 
Variable Loading Definition 

23 79 ~=- Factor C (Cattell’s 16 P.F.) 
(high emotionality) 
Factor O (Cattell’s 16 P.F.) 
(high anxiety) 
Factor H (Cattell’s 16 P.F.) 
(withdrawn behavior) 
Factor E (Cattell’s 16 P.F.) 
(submissiveness) 

Factor L (Cattell’s 16 P.F.) 
(paranoid tendency) 

Superego Strength Domi- 
nance (censure of domi- 
nance) 

Marker Dominance (low) 
(submissiveness) 


This factor is clearly a second-order factor found 
among a certain set of primary personality factors 
in Cattell’s 16 P.F. test. The C factor (in its nega- 
tive direction) has previously been considered the 
pattern of general emotionality and presumably 
neuroticism, but Eysenck’s recent claim that the 


®Four groups of items, two groups for each erg, 
were devised to measure the strength of the superego. 
The format and principal of quantification of these 
groups was quite similar to “paired comparisons.” Vari- 
ous unfavorable statements of diverse content about 
a person were paired with groups of such statements 
centering in content about our two ergs. Any indi- 
vidual’s score was the number of times he chose an 
alternative which contained our ergic content. 
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TABLE 2 
Drmection Costnes AMONG THE Factors (A'A) 
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neuroticism syndrome is a second-order factor, 
largely loading C, but also O (anxiety) and others, 
receives definite support from this finding. No 
single defense mechanism or erg seems specifically 
tied to this pattern, however, and it is an irrelevant 
finding from the standpoint of our main study. 


Factor IV, Naiveté, Stupidity (Obverse of General 
Intelligence) 
Definition 
Naive-Misperception 
Dominance 
Naive-Misperception 
Mating 
34 Naive-Misperception 
Dominance 
48 Naive-Misperception No. 
Mating 
Press Misperception 
nance 
40 Autism Misperception Domi- 
nance 


Variable Loading 
15 77 


19 67 


Domi- 


Clearly the composition of this factor indicates 
the name we have given it, for here the four high- 
est loadings are for the measures of naiveté. How- 
ever, it is noteworthy that we have not succeeded in 
separating deficient information from deficient in- 
telligence as a cause of misperception. Possibly a 
larger population would reveal a fresh factor cor- 
responding to one of these. 


Factor V, Autism 
Variable Loading Definition 
29 77 Autism Mating 
40 68 Autism Fear 
45 54 Autism Play 
44 Rationalization Play 
35 Autism Misperception Mating 
True Misperception Projec- 
tion Mating 
Superego Strength Mating 
(low) 


Rationalization Dominance 


This factor appears to represent the process of 
autism, for the three variables which load it the 
highest are autism variables; of those eight variables 
with the highest loadings, four are of this autistic 
nature. Also, in no other factors did more than 
one of the autism variables appear among the sig- 
nificant loadings. 
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Factor VI, Mating 
Variable Loading Definition 
31 76 Marker Mating 
24 66 Marker Mating 
5 —45 Low Superego Strength Mat- 
ing (little censorship of 
sex) 
Low Superego Strength Mat- 
ing (little censorship of 
sex) 


—40 


As hypothesized, there existed in our correla- 
tion matrix an independent source of variance cor- 
responding to the mating erg. The two variables 
with the highest loadings on this factor are the two 
tests designed as markers for the mating erg. In 
addition, the next two highest variables on this 
factor deal with the amount of overt sex expression. 


Factor VII, Rationalization 


Variable Loading Definition 


55 Rationalization Misperception 
Dominance 

3 45 Superego Strength Mating 

27 45 


Rationalization Misperception 
Mating 


Factor VII, rationalization, has been so identified 
because two of the three highest loadings are ra- 
tionalization variables, one being the highest. Of 
the six rationalization variables in the matrix, three 
load this factor significantly and there is no rival 
grouping of them anywhere else. Some association 
of sex drive strength with rationalization tendency 
seems indicated. 


Factor VIII, True Projection 
Variable Loading Definition 
21 59 Press Misperception Mating 
53 True Projection Mispercep- 
tion Mating 
Phantasy Misperception Mat- 
ing 
True Projection Mispercep- 
tion Dominance 
Naive Misperception Mating 
: No. 3 
41 : Phantasy Mating 
37 Factor L (Cattell’s 16 P.F.) 


Factor VIII, which is loaded by the two projec- 
tion tests, is the only dynamic factor which has 
appreciable loadings of these two variables. In other 
words, all of the true projection variables in the 
matrix load this factor significantly. Again there is 
some association with strength of sex drive, and this 
time it is considerable. Conceivably we may have 
to recognize these intrusions by the hypothesis that 
strength of libido is in general correlated with prone- 
ness to defense mechanisms, especially phantasy, 
rationalization, and true projection (Freud's concept 
of the “psychosexual constitution”). The same asso- 
ciation with defenses might not be expected to occur 
with any drive not subject to such constant social 
inhibition. 

It is interesting to note that the only significant 
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loading of personality factor L, paranoid tendency 
(apart from its second-order factor appearance), oc- 
curs here, supporting the clinical hunch that the 
projection defense is especially favored in the para- 
noid make-up. 


SUMMARY AND CONCLUSIONS 


A factorization of 48 variables, compris- 
ing various measures of misperception proc- 
esses in interpreting 40 pictures, as well as of 
ergic stimulus conditions and various criter- 
ion variables has been carried out for a popu- 
lation of 67 in an experimental design known 
as condition-response factor analysis. 

The hypotheses guiding the experiment 
were that we should find at least eight dis- 
tinct misperception process factors, as well as 
independent factors for the two ergs involved 
and perhaps also for the stimulus conditions. 
Omitting possibilities of extracting further 
factors of very small variance, eight factors 
have actually been found, which are inter- 
preted as follows: Self-Assertive erg (domi- 
nance drive), Mating erg (sex drive), 
Cognitive -Disability (naive misperception), 
Autistic Misperception, Rationalization, Phan- 
tasy, True Projection, and General Neuroti- 
cism, a second-order personality factor un- 
connected with the misperception measures. 

The cognitive sources of misperception— 
low intelligence, defective life experience, 
defective comprehension of the self relative to 
others—therefore operate, at this degree of 
fineness of factor extraction, as a single in- 
fluence. This agrees with our earlier (4) 
hypothesis of naive projection, better named 
recently as naive misperception. 

All the five hypothesized dynamic sources 
of misperception except Press Compatibility 
have been demonstrated as unitary tend- 
encies, with the characteristics assigned to 
them. The only high loading of a press com- 
patibility variable is in the factor interpreted 
as True Projection, which suggests for in- 
vestigation that a single underlying mecha- 
nism may be involved both in projecting an 
unconscious drive and in reconciling external 
facts with conscious internal moods, namely 
a need for self-consistency. If so, the under- 
lying process would be better labelled as 
self-saving. 

If there is a single factor covering all de- 
fense mechanisms, as distinct from other 
dynamic sources of misperception, it is too 
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faint to show at this magnification, i.e., with 
the standard errors of a population of 67. 
Nevertheless, there is some association of 
phantasy, rationalization, and projection with 
high libido strength, suggesting that a second- 
order factor (psychosexual constitution) may 
underlie a general tendency to use such 
defenses. 

The misperception processes are demon- 
strated to act independently of attachment to 
any particular erg. The defenses generalize 
beyond the misperception situation, and the 
ergs stand out (at least the two of largest 
variance and for which definite variables 
were available) independent of each other. 
This is a demonstration of ergic unity which 
definitely supports the recently available evi- 
dence of the reality of unitary ergs (3, 7, 8). 

Nevertheless there are some suggestive 
loadings to the effect that certain personality 
dimensions, if not certain ergs, are prefer- 
entially associated with the use of certain 
misperception processes. Notably there are 
indications of (4) phantasy with high sex, 
and perhaps dominance (E), and with the 
anxiety factor (O); (4) rationalization with 
superego strength; and (c) true projection 
with paranoia (L factor). 

The second-order factor found among per- 
sonality factors has no definite bearing on this 
research, but seems to correspond with one 
found earlier and considered to be general 
neuroticism. It is perhaps surprising, how- 
ever, that this has no particular association 
with stronger tendencies to use the ego de- 
fense mechanisms generally. 

No evidence was found of significant 
heightening of ergic strengths, in terms of 
the recency of exposure to stimulus situations 
specific to those ergs. The conclusion must 
be drawn that the experimental stimuli used 
here need to be changed or amplified and the 
degrees of quantification increased in any 
future experimental attempts to increase mis- 
perception effects by temporarily increasing 
drive strengths. 

The presence of the process variables, 
notably phantasy and projection, in the 
ergic strength factors shows that the amount 
ofmisperception is proportional to the 
strength of the overt erg. This agrees with 
our previous finding by another method (5). 
However, it does not deny that the misper- 
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ception may also be proportional to the uncon- 
scious, covert strength (about which we had 
no evidence) by the principle of overt-covert 
proportionality (5). 

The current clinical practice of assuming 
“projective” misperceptions to be proportional 
directly to (or some function of) the strength 
of unconscious motivation and inversely to 
overt expressions’ is shown to be funda- 
mentally erroneous, in two ways. First, the 
proportionality is direct to overt drive 
strength. Secondly, and far more important, 
the misperception is not due to a single 
process, projection, but to several dynamic 
misperception processes, which, superposed, 
frequently act in differing direction. 

The constructive implication of the above. 
is that the estimation of personality factors, 
principally the strength of ergs or other dy- 
namic needs of the individual, through mis- 
perceptive tests, should be considered in terms 
of the factorial specification equation. Tests 
as factorially pure as possible should be pre- 
pared for the separate misperception processes 
in order that their scores may be combined 
to give the best estimate of a given personality 
trait. At present, the confusions of several 
processes in a single “projection” produce 
cancellations which end in a very small, 
generally insignificant correlation of the mis- 
perception score with a personality score. 

In practice this would indicate the use 
of a set of tests, each measuring a pure 
process and a pure erg, in all possible combi- 
nations. The estimate of ergic strength would 
have the advantage of being based on several 
misperceptive processes equally weighted in- 
stead of, as now, being based on projection 
only, or on a mixture of projection, phantasy, 
etc., in which the different processes undo 
one another’s effects. 

While this demonstration of a particular 
plurality of processes, and of the independence 
of the processes and drives, confirms our 
main hypotheses, it also opens up immediately 
a fresh set of significant problems, notably: 

a. Can the same unitary and independent 


assumption is occasionally denied—but only 
4 vacuum in thought unrelieved by any in- 
theory! Rorschach or TAT user deals 
with complex resultants of perceptual processes, but 
this does not absolve him from making clear assump 
action of the 


? This 
to reveal 


The 


NW KI 
telligibie 


tions about the nature and direction of 


constituent processes. 
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processes be demonstrated also within one 
individual by P technique? 

b. By exploring the loading patterns for 
inserted items of new kinds, designed on 
various logically definite hypotheses, can we 
carry the conceptions of the defense mecha- 
nisms, etc. to a fuller understanding than 
has been achieved by non-quantitative clinical 
methods? 

¢. Can the same independence be demon- 
strated for other processes and ergs than 
those tried out here? 

d. As a matter of importance in practical 
routine, what effects can be produced in the 
total misperception through manipulating 
the principal and ground relations on which 
the processes act. The current phrase “re- 
sponse to unstructured materials,” is as mis- 
leading as the term “projective”; for, with 
the possible exception of our sixth process, 
phantasy, the response is to structure, not 
to chaos, and it is probable that the individual 
differences in misperception which we meas- 
ure would be more comparable, reliable, and 
meaningful if the possible directions of mis- 
perception were better understood, controlled, 
and varied. 

e. Can more powerful external stimuli 
than we applied produce the expected effect 
of greater misperception with greater tempo- 
rary excitation of a drive, which we failed 
to get? 
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SOME CORRELATES OF TEST ANXIETY 


SEYMOUR B. SARASON AND GEORGE MANDLER 
Yale University 


N previous reports (1, 2, 3) the writers 

have described experiments concerning 

anxiety and learning. In these studies an 
individual's anxiety score was based on re- 
sponses to a questionfaire containing ques- 
tions about attitudes and experiences in three 
kinds of testing situations: the individual 
intelligence test, the group intelligence test, 
and course examinations. 

The present paper has three aims: (a) to 
give a more detailed and up-to-date descrip- 
tion of the questionnaire, (4) to present find- 
ings on the relation of test anxiety to certain 
psychometric and social-class data; and (c) 
to describe the relation of the anxiety ques- 
tionnaire to one concerned with habitual reac- 


tions to frustration. 
THe ANXIETY QUESTIONNAIRE 

Administration 

Table 1 contains several of the items in the ques- 
tionnaire. On the face sheet of the questionnaire there 
was a rather long explanation of what the student 
was to do and why this type of questionnaire might 
be fruitful. Verbally the students were told that “al- 
though we live in a test-conscious culture, we know 
relatively little of how people react to various types 
of test situations. We want you to be as frank as 
possible in telling us what your experience in test 
situations is.” On the reverse side of the last sheet 
of the questionnaire the students were asked to give 
their opinions about the role of tests at Yale. This 
request, in addition to the explanation on the face 
sheet, seemed to produce in the students the reac- 
tion that at last someone was interested in their opin- 
ions on educational practices. More than a few 
students wrote long—and astute—comments on the 
role of tests in modern higher education. On the 
face sheet, and verbally, the students were told that 
their responses and opinions would not be made 
known to any official or instructor of the Univer- 
sity. Several times during the taking of the ques- 
tionnaire an earnest plea was made for the students’ 
frankness and honesty. 


Results 

Between 1949-1950 the anxiety question- 
naire was given to five undergraduate classes 
consisting largely of sophomores and juniors.’ 


1 The writers want to express their sincere apprecia- 
tion to Drs. Buxton, Child, Kimble, and Robinson for 
making class time available for the various studies. 


Four of these classes were introductory psy- 
chology classes, the questionnaire being ad- 
ministered very soon after the term had 
started. The five classes were then considered 
as one group (N=492) and the distribution 
of scores (a) for each of the 42 questions and 
(4) for the total score was then determined. 
The distribution of scores for each question 
was then compared to the distribution of 
total scores in the manner described in the 
first paper (1), except that now the .0o2 in- 
stead of the .03 level of confidence was used 
for determining inclusion or exclusion of a 
question. There were six questions whose 
distributions differed from the total at or 
below the .oo2 level, and they were then 
excluded. The score for each individual was 


then recomputed. An individual's total score 
could then range from o (never above the 


TABLE 1 


SoME OF THE Questions UsED IN THE ANXIETY 
QUESTIONNAIRE 


. If you know that you are going to take a group in- 
telligence test, how do you feel beforehand? 


I 


Feel very confident Midpoint Feel very unconfident 


. While taking a group intelligence test to what ex- 
tent do you perspire? 
Perspire a lot 


Perspire not at all Midpoint 


7. Before taking an individual intelligence test, to 
what extent are you (or would you be) aware of 
an “uneasy feeling”? 

Am very much 


aware of it 


Am not aware Midpoint 


of it at all 


.In comparison to other students, how often do you 
(would you) think of ways of avoiding an indi- 
vidual intelligence test? 


Less often than 
other students 


As often as 
other students 


More often than 
other students 


. When you are taking a course examination, to what 
extent do you feel that your emotional reactions 
interfere with or lower your performance? 


Midpoint Interfere 
a great deal 


Do not interfere 
with it at all 
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median on any question) to 36 (always above 
the median). Figure 1 shows the distribution 
of total scores for the 492 subjects (Ss). 
Reliability coefficients were not computed 
for the present group because of previous 
analyses. A split-half test on one class 


(N=100) produced a correlation (Spearman- 
Brown) of .g1, and a test-retest analysis (six- 
week interval) on another class (N=70) 
produced correlation of .82. 
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Soctat-CLass FInpIncs 


In the earlier studies the extremes of the 
distribution of total anxiety scores were used. 
There were two reasons for such an approach: 
(a) the hypotheses on which the studies were 
based called for the use of extreme groups, 
and (4) it could not be assumed that the 
distribution of scores represented a continuum 
in the sense that an increase in score will 
produce certain predicted relationships with 
other variables. It might be, for example, 
that an individual who gets a higher score 
than another individual does not necessarily 
make a stronger anxiety response but reacts 
differently to the anxiety, even on the ques- 
tionnaire. Put in. another way, two indi- 
viduals might in fact have the same strong 
anxiety tendency but differ in their readiness, 
conscious or unconscious, to reveal it. It is 
a truism that people differ in how they have 
learned to defend themselves against an ex- 
perience of anxiety. By using extreme groups 
it was felt that one was more likely to get 
within each group homogeneity in regard 
to the strength of and reaction to anxiety. 

In the analysis below the results of three 
different comparisons are given. One of 
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these comparisons concerns the two extreme 
groups (high and low anxious) who are 
comparable to the extreme groups used in 
previous studies in terms of position in the 
distribution of scores. In Fig. 1 these are 
given as the HA, and LA, groups. The HA, 
group (N=67) represents cases going into 
the 86th percentile, and the LA, group (V= 
66) represents cases going into the 13th per- 
centile at the other end of the distribution. 
The second comparison concerns an HA, 
group (N=74) which lies between the 71st 
and 86th percentiles and an LA, group (V= 
80) which is between the 13th and 30th 
percentiles. The third comparison involves 
HA(HA,+HA,) and LA(LA,+LA,) 
groups. 

In the previous reports there will be found 
no discussion of the questions: What are 
the sources of the extreme reactions of the 
high anxiety groups? Why should taking 
tests be so upsetting to them? The asking 
of such questions obviously had to be post- 
poned until there was some verification of the 
clinical observation that test anxiety was an 
important factor in learning and performance. 
Such verification is contained in the previous 
reports. 

The following was considered to be a 
fruitful hypothesis for understanding one of 
the possible sources of extreme anxiety re- 
sponses to the taking of tests: The stronger 
the anxiety responses to a test situation the 
sronger the need for intellectual achievement. 
The more an individual feels he should or 
needs to achieve in intellectual tasks, the 
more likely it is that a relevantly challenging 
situation will arouse thoughts of not achiev- 
ing—of failing and then experiencing the 
subsequent punishments. In short, extreme 
anxiety responses indicate that the individual 
is in a danger situation, dangerous in the 
sense that the possibility of failing (with at- 
tendant punishments) is very likely to occur. 

It should be made clear that such an ex- 
planation does not imply that a highly anx- 
ious S necessarily has a stronger need to 
achieve than a nonanxious one. The latter 
may have an equally strong need to achieve, 
but when the need is evoked it does not give 
rise to anxiety responses. In addition, such 
an explanation does not imply that in highly 
anxious Ss the anxiety is only cued off by 
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an achievement drive. The relationships are 
undoubtedly far more complex. 

The discussion thus far has begged the 
question: Why should anxiety and the need 
for intellectual achievement be intimately 
related in highly anxious Ss? The following 
two general predictions, which by their con- 
tent indicate one possible answer to the 
question just raised, were made: 

Prediction 1: In general, LA Ss come 
from those social and economic strata in 
which intellectual achievement is not a pri- 
mary value. The conditions of learning are 
such so as not to make it likely that anxiety 
will be learned in relation to intellectual 
achievement. Many of these individuals 
would not be expected to have a strong 
need for intellectual achievement. Their life 
goals are of such a nature that intellectual 
achievement is not a necessary means for 
their fulfillment. Most of this group are 
probably upper-middle, lower-upper, and up- 
per-upper social class individuals. 

Prediction 2: In general, HA Ss come from 
and economic strata in our 


those social 


society in which conditions producing anx- 
iety about intellectual achievement are likely 


to occur. To these Ss, intellectual achieve- 
ment is important for their post-college 
career. It is a means to a very important end. 
As a group they are socially mobile, contain- 
ing predominantly lower-middle and upper- 
lower social class individuals. 

The files of the Student Appointment 
Bureau were the only sources of information 
available for gathering of data relevant to 
these predictions.? These files, however, did 
not contain the kind of data (family income, 
type of housing, community affiliations, etc.) 
which are necessary for setting up a social- 
class index. There were four kinds of data 
which were considered to be a reflection of 
social-class standing: 

1. Occupation of father. Three occupational cate 
gories were established: (a) executives, brokers, di 
rectors of corporations, Owners or partners in large 


concerns; (4) managerial occupations, professional 
men, owners or partners of small firms, civil service 


2 The writers wish to express their sincere apprecia- 
tion’to Dr. Burnham and the staff of the Yale Student 
Appointment Bureau for their assistance in collecting 
the data. 

8 The occupation of the father was that given by the 
student on his application for admission. For several 
reasons, not all of which need be specified here, using 
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employees, retired businessmen; and (c) manual 
laborers, secretarial and minor civil service occupa- 
uions. 

There was 86 per cent agreement between two 
raters on the classification of the fathers. 

2. Scholarship grants. Scholarships at the univer- 
sity are awarded on the basis of merit and need. 
The student, however, must apply for the scholar 
ship. It was expected that more HA than LA Ss 
would need and be deserving of scholarships. 

3. Father’s education. It was expected that more 
of the fathers of LA than HA Ss would have had a 
college education. 

4. Student’s previous schooling. It was expected 


that more LA than HA Ss would have come to Yale 


from some sort of private rather than public school. 


Results 


Table 2 gives the results relevant to the 
predictions. The results tend to confirm 
the predictions: a significantly higher per- 
centage of the fathers of HA than LA Ss 
have middle-class occupations, and _signifi- 
cantly more of the HA than LA Ss are on 
a scholarship basis. For the other two factors, 
previous schooling and father’s education, the 
differences are not significant but are in the 
expected direction. The analysis also reveals 
that there are differences within the HA and 
LA groups. Significantly more of the HA, 
than HA, fathers have had a college educa- 
tion. Within the LA groups, LA, differs 
from LA, in that significantly more fathers 
have middle-class occupations. More LA, than 
LA, Ss have scholarships. The within-group 
analysis tends to support the conclusion 
that the distribution of anxiety scores does 
not represent a social class continuum. It 
could support the conclusion that although 
there is an over-all relationship between anx- 
iety and social class factors, the complexity 
of the relationship is of such a nature as to 
preclude a simple explanation. In any event, 
the fact that the predictions tended to be 
confirmed lends credence to the hypothesis 
that anxiety in the test situation is related to 
a strong need for intellectual achievement. 





the student's report of his father’s occupation would 
work against the prediction made. For example, if a 
student said only that his father was a lawyer, then 
he was put in category 6. It was quite clear (from 
other data) in sorne of these cases that merely saying 
“lawyer” did not reflect the father’s status in the 
business and professional world. Another example: if 
a student said his father was a physician he was also 
put in category 5, even though the father may be, so- 
cially speaking, in one of the upper classes. Category 
& probably contains more than a few cases who should 
be in category a. 





SoME CorRELATES 


It will be recalled that this hypothesis was 
the basis for the social class predictions. 


PsycHoMETRIC FINDINGS 
Theoretical Considerations 

The following theoretical framework is sum- 
marized from a more extended presentation in a 
previous article (1). 

Anxiety will be considered as a learned drive with 
the characteristics of a strong stimulus. When 
anxiety has been learned as a response to situations 
involving intellectual achievement (e.g., test situa- 
tions), two types of responses will tend to be evoked: 
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ment anxiety, this increase in anxiety drive 
will lead to poorer performance in individ- 
uals who have task-irrelevant anxiety re- 
sponses in their response repertory. For in- 
dividuals without such response tendencies, 
these stimulus elements will raise their general 
drive level and result in improved per- 
formance. (Such elements would be any 
reference that the individual is being judged, 
a statement of expected performance, etc.) 
3. When an individual with a strong anx- 
iety drive has repeatedly been rewarded for 


TABLE 2 


Sociat Crass DirFeRENcES BETWEEN THE HicH AND Low Anxiety Groups 








ANXIETY GROUP 





CATEGORY HA LA 


(N=141) (N=146) 


HAs 
(N=74) 


LA; 
(N66) 


HA; 
(N=67) 





Prior Education 
Private school 6s. 
Public school 34- 

Father’s Occupation * 


58.2 
41. 


- / Ld 
17.0% 27. 


64. 
8 


Fathers’ Education 

College 

No college 
Scholarship Recipients 

Yes 31.9% 23. 
No 68.1 76. 


68. 
31. 





CoMPal 


53-7 
40. 


25. 

65.7 
8. 

38.8% 


34.35 


65 





SuMMARY OF SIGNIFICANT DIFFERENCES BETWEEN Groups 





SON CLR. P 





Father’s Occupation 


Father’s Education 
Scholarship Recipients 


LA; vs. LAs 
HA vs. LA 
HA; vs. HAs 
HA vs. LA 


.66 <i 
+27 


+10 


.63 


LA: vs. LAs .20 





* See text for elaboration of the categories a, 6, and c. 
** One tail of the distribution used. 


(a) responses which are not task-relevant; self-cen- 
tered feelings of inadequacy, attempts at leaving the 
situation, etc., and (4) task-relevant responses which 
reduce the anxiety by leading to completion of the 
task. These are identical with other task-relevant 
responses evoked by the testing situation. 


The following hypotheses are relevant to 
the present study: 

1. Individuals with a strong anxiety drive 
(HA Ss) will perform poorer than those 
with a low anxiety drive when the task 
irrelevant responses interfere with adequate 
performance. 

2. When a stimulus situation contains ele- 
ments which specifically arouse test or achieve- 


making task-relevant responses in certain 
situations (e.g., success in course examina- 
tions, passing courses), these responses will 
become dominant over irrelevant responses 
in the task situation, and performance, owing 
to the summation of drives, will be superior 
to that of individuals with a low anxiety com- 
ponent. 


Source of Data and Predictions 

Upon entrance to Yale students are given 
a Mathematical Aptitude Test (MAT) and 
a Scholastic Aptitude Test (SAT). Upon 
the basis of these tests and other data, such 
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as high school grades, a predicted grade 
average (PGA) is computed for each student. 
In addition to these measures the student’s 
actual grade average (AGA) for the previous 
academic year was recorded. 


Prediction 1: The HA, Ss will have sig- 
nificantly lower scores on MAT and SAT 
than the LA, Ss. 

Prediction 2: The HA, Ss will have a 
significantly lower PGA than the LA, Ss. 
In a sense this prediction is contained in the 
first one because the predicted score is in 
part based on MAT and SAT. 

Prediction 3: The HA, Ss will have a 
significantly higher AGA than the LA, Ss. 
In a test situation, such as a course exam, 
the student is not faced with a novel situation. 
He knows the examiner (instructor), he 
has an idea of what the exam will contain, 
and there is time for previously learned 
anxiety-reducing task-relevant responses to 
become operative. These conditions are ab- 
sent in a group or individual intelligence 
test situation. In the case of the course exam, 
the HA, S’s strong achievement and anxiety 
drives should result in better performance 
than that of the LA, S. It will be recalled 
that in the first study (1), the HA, Ss per- 
formed better than the LA, Ss when nothing 
was done to raise anxiety level after it had 
been presumably reduced from its original 
level (neutral condition). 

Assuming that the anxiety 
a behavioral contin- 


Prediction 4: 
distribution represents 
uum, one would make predictions similar to 
those above for a HA, vs. LA, comparison. 
Making the same assump- 
tion, one might predict that HA, Ss would 
differ from HA, Ss in that they would have 
lower MAT and SAT scores, a lower PGA, 
but a higher AGA. In comparing LA, and 
LA, one might expect the former to have 
higher MAT and SAT scores, a higher PGA, 
but a lower AGA. 

Prediction 6: Assuming that different anx- 
iety scores (for example, an HA, vs. an HA, 
S) do not necessarily reflect the true anxiety 


Prediction 5: 


score which is obscured by differences in 
readiness to admit strength of anxiety in the 
questionnaire, one would not expect the sig- 
nificant differences 


described in 1 and 2 
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above.* An HA, group may, for example, 
contain individuals who, if a true anxiety 
score could be obtained, would be in the HA, 
group. Thus, to compare an HA, to an 
LA, group (where the same problem arises), 
or an HA, to an HA, group, would not be 
expected to give significant differences. 

Table 3 presents the data relevant to the 
predictions. It will be seen that predictions 
1, 2, 3 for the HA, and LA, groups tend to 
be borne out: the HA, group obtained sig- 
nificantly lower MAT and SAT scores, a 
significantly lower PGA, and a higher AGA 
which approaches significance. 

In the comparison (prediction 4) involving 
HA, vs. LAs, there are no significant dif- 
ferences or differences approaching signifi- 
cance. In the comparison (prediction 5) in- 
volving HA, vs. HA, there are no significant 
differences; the only result approaching sig- 
nificance, in the expected direction, is the 
higher AGA of the HA, group. 

In the comparison (prediction 5) involving 
LA, and LAg, the results are all in the 
expected direction: the LA, group obtained 
significantly higher MAT and SAT scores, 
a significantly higher PGA, and lower AGA 
which is not significant but in the expected 
direction. 

It would appear from the results relevant 
to predictions 4 and 5 that the low end of 
the anxiety distribution approximates a be- 
havioral continuum while the high end does 
not. While at the low end an increase in 
score produces certain predicted relationships, 
at the high end a decrease in score does not 
result in the predicted relationships. In the 
light of this it was thought desirable to see 
how the various measures correlated with 
each other in the several groups. Table 4 
shows the correlations cf MAT vs. SAT and 
PGA vs. AGA within the various groups, 
and the significance of the differences among 
the correlations. The major results appear 
to be: (a) the correlations for the HA, 
group are always higher, in one instance 
significantly and in the other near signifi- 
cantly, than for the HA, group; (4) the 


*In the section on social class findings, this problem 
was not considered particularly relevant since there 
one was dealing only indirectly with psychological 
processes. HA: and HAg (or LA: and LAs) Ss may 
come from similar backgrounds although their reac- 
tion patterns to anxiety may differ markedly. 
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TABLE 3 





LA; 





Scholastic 
(SAT) 
Mathematic Aput 
(MAT) 
Actual Grad 
(AGA) 
Predicted Grade 


(PGA) 


Aptitude Test 

577-9 
ude Test 

602.8 

Average 

78. 
Aver Age 

75.82 








AND NEAR-SIGNIFICANT COMPARISONS 
COMPARISON t 


SIGNIFICANT 





Scholastic Aptitude Test -44 
-49 


HA vs. LA 2 
HA: vs. LA: 2 
HAs vs. LAs I 
LA; vs. LAs 1.39 
HA vs. LA 2.21 
HA, vs. LAs 2.68 

I 

I 

I 

I 

I 

I 


.00 
Mathematic Aptitude Test 


LA; vs. LAs 
HA, vs. LAy 
HA; vs. HAs 
HA vs. LA 

HA; vs. LA; 
LA; vs. LAs 


Actual Grade’ Average 


Predicted Grade Average 





* One tail of the distribution of ¢ used. 


correlations for the LA, group are always 
lower, in one instance significantly but not 
in the other, than the LA, group. If, as 
is usually the case, extreme scores on a test 
are less reliable than those away from the 
extremes, then the results for the LA, and 
LA, groups are what one would expect. Such 


an explanation obviously does not explain 
why the HA, Ss are so much more consistent 
than the HA, Ss. These contrasting findings 
lend more support to the hypothesis that the 
high end of the anxiety score distribution 
does not represent a continuum of behavioral 
manifestation. 


TABLE 4 


CorRELATIONAL FINDINGS ON PsYCHOMETRIC AND ScHoLasTic Data 








en scholastic and mathematic aptitude test scores for the high and low anxiety groups. 


HA LA 


___HaAs LA LAs 








Pearson r -037 . 384 


+220 +492 


SIGNIFICANT DIFFERENCES BETWEEN Groups 





CoMPARISON 


t P 





HA vs. LA 

HAg vs. LAs 
HA, vs. HAg 
LA: vs. LAs 


-OO! 
-OOo! 
-31 -09 
.85 +03 





B. Correlation between achieved and predicted 
HA LA 


grade average for the high and low anxiety groups. 
HA; 


HAs LA: LAs 





Pearson r -531 -455 


-657 


+397 +447 -495 





SIGNIFICANT 


DirFerRENcES BETWEEN Groups 





CoM PARISON 


t P 





HA; vs. LA; 
HA; vs. HAs 


+02 
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ANXIETY AND Group 
INTELLIGENCE TEst Scores 

In one of the classes (N==88) it was possi- 
ble to give the Henmon-Nelson Test of 
Mental Ability, college level. A 20-minute 
time limit was utilized. The correlation 
(Fig. 2) between anxiety and intelligence 
test score was +.209, which was significant 
at the .05 level. The most interesting feature 
of the scatter diagram is the fact that the 
6 sudents with the highest intelligence test 
scores invariably have a low anxiety score. 
However, it is not true that those with the 
lowest intelligence test scores have the highest 


anxiety scores. 








Fic. 2. ScATTERGRAM SHOWING RELATION BETWEEN 
Scores AND HenMon-NELSON 
Test Scores 


ANXIETY 


ANXIETY SCORE AND 


Reactions To FRUsTRATION 


The questionnaire on “habitual reactions 
to frustration” is one which has been used 
by Waterhouse and Child (4). Aside from 
a total “interference” score, the questionnaire 
can also be scored so that the different facets 
of reaction to frustration can be categorized 
(4). Table 5 shows the correlations between 
the anxiety score and the various scores de- 
rived from the frustration questionnaire. In 
interpreting the data in Table 5 it should be 
kept in mind that the “over-all interference” 
score is probably the most reliable, since it 
is the measure that is based on the most 
questionnaire items. The correlations are 
based on 62 Ss who happened to be used in 
both this and the Waterhouse-Child study. 
The data rather clearly demonstrate a rela- 
tion between test anxiety and degree of inter- 


fering responses to frustration. However, 


the data are not revealing as to the causal 
connection between the two. 


Discussion 


One of the main problems arising from 
this study concerns the high end of the 
test-anxiety distribution of scores. Specifi- 
cally, the question is why the predicted re- 
lationships do not obtain in a HA, vs. HA, 
comparison, but do obtain in a LA, vs. LA, 
comparison. One explanation involves the 
assumption that HA, and HA, Ss differ 
not in strength of their anxiety responses 
but in the ways in which they defend them- 
selves against recognition and expression of 
such responses. One might expect, for ex- 
ample, that while the HA, S is aware of 
his anxiety responses in the test situation— 
an awareness of self which interferes with 
appropriate awareness of the stimulus ma- 
terials—the HA, S experiences less of such 
self-awareness but makes other types of re- 
sponses which also have an interfering effect. 
A clinical example might make this line 
of reasoning clearer. Two patients may both 
have experienced strong hostile feelings to 
their therapists shortly before a_ particular 
therapeutic hour. Once the hour has begun 
patient A finds his “mind is a blank” and he 
has little to talk about. Patient B continues 
to be aware of his hostile feelings but strives 
to avoid expressing them. In both cases 
the effect of the different defenses is inter- 
ference with functioning. If after the thera- 
peutic hour one were to give the two patients 


a questionnaire designed to measure the 
strength of their hostile feelings towards their 
therapist, one might find that patient B gets 
a higher score than patient A. Their “true” 
scores might not be different. These examples 


TABLE 5 


CorrELaTions BETWEEN Test ANXIETY AND 
“Hasituat REAcTIONS TO FRUSTRATION” 


ScorE 





Anxiety vs 
Over-all interference 
Preoccupation and rehearsal 
Counteraction 

Aggression 
Pessimism 
Self-aggression 
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are given merely to suggest that the differ- 
ences between HA, and HA, Ss may be a 
difference in defense mechanisms. Since both 
HA, and HA, Ss are presumably in a state 
of strong drive, one would expect differences 
in learned defense mechanisms to become 
manifest. In the case of low drive Ss (LA, 
and LA.) defense mechanisms are not as 
likely to be evoked. The possibility can- 
not be overlooked, however, that in some 
cases a low anxiety score might also be re- 
lated to the need for defense. 

In general terms, there appear to be two 
related problems: (@) the relationship be- 
tween strength of drive and adequacy of 
defense, and (4) the relationship between 
strength of drive and choice of defense. By 
approaching the problem of test anxiety 
from the standpoint of the nature and 
strength of defense mechanisms, it may be 
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possible to explain individual differences more 

adequately than is now the case. 

One other important implication of the 
present study can be given as a quotation 
from an earlier one (1): “It is questionable 
whether intelligence test scores adequately 
describe the underlying abilities of individuals 
with a high anxiety drive in the testing 
situation.” 
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ucn human behavior is inextricably 

associated with verbal processes. 

Adequate theoretical accounts of 
such behavior cannot be expected until the 
role of language is better understood. This 
understanding is in itself a long-range goal, 
to be gained by careful analysis of the com- 
plexities of language and the assiduous gath- 
ering of basic empirical data. The prelimi- 
nary study reported here was undertaken as 
a step in this direction. 

Investigations of verbal behavior have fre- 
quently encountered the intricate problem of 
context. This problem, of course, may be 
analyzed into a large number of factors. 
One aspect which has received attention is the 
tendency for words to appear in meaning- 
fully related clusters even in the absence of 
the usual syntactically structured communica- 
tion situation. 

Bousfield, for example, has noted clustering 
f words during the production of discrete 
or individual words (1, 2) and has dealt 
with this phenomenon in experimental set- 
tings. In a recent experiment (3) he showed 
that a randomized list of words drawn from 


lo 
‘> 


four categories (animals, vegetables, names, 
professions) tends to be recalled in clusters 


of related (i.c., same category) words. He 
postulates a “relatedness increment” of habit 
strength to account for the clustering. 

The writers have also observed tendencies 
for discretely produced words to appear in 
groups or clusters. This typically occurs 
even when experimental instructions require 
the subject to say individual words and 
avoid sentences, phrases or counting. 

Among the possible determinants of cius- 
tering as observed in these experiments is 
simple word-association strength. In the pres- 
ent study an effort was made to determine 
the extent to which clustering during recall 
is a function of such associations. 

Word-association techniques have a long 
history of use in psychology. Typically a 
stimulus word is provided by the experi- 


I 

1 This study is part of a larger series of studies of 
verbal behavior being conducted at the University of 
Minnesota. The series is being sponsored by the Office 
of Naval Research (Contract No. N8onr 66216) under 
its policy of encouraging basic research. 
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menter (E) and the subject (S) is asked 
to respond with the first word that occurs to 
him. Commonality of response has been 
so pronounced for certain stimulus words 
that it has been possible to develop cultural 
norms from which one can predict individual 
responses with fair accuracy (6, 7, 8). The 
frequency with which a particular stimulus 
word is followed by a given response word 
is considered to be an index of the associative 
strength of the paired words within the 
norm group. The predictive utility of this 
index when associations are elicited in some- 
what different situations is not well known. 
There is some evidence that objects or pic- 
tures substituted for the stimulus words will 
evoke responses similar to those made to the 
stimulus words in the test (5). However, 
the generality of these associative phenomena 
when the stimulus is produced by S himself 
rather than by E has not been systematically 
investigated. 

During the recall of discrete words, S 
might be expected to respond to his own 
verbal production as if these stimuli origi- 
nated from an external source. It might be 
hypothesized that the associative phenomena 
noted in the classical word-association test 
would occur during the recall of word lists 
containing associatively related words. The 
clustering of words during recall thus would 
be expected to be, at least partially, a function 
of culturally determined associative strengths. 

Four hypotheses emerge from this line of 
thinking. First, associative pairs should occur 
more frequently during recall than nonsys- 
tematically selected word pairs. This amounts 
to saying that word pairs known to occur 
with high frequency in word association 
tests will appear more often than randomly 
determined pairs. 

Second, reverse associations should occur 
more frequently than nonsystematically de- 
termined pairs. That is, the response word 
of a strong association would be expected 
through its common occurrence (perhaps in 
varied sequence) with its stimulus word in 
everyday language to have some tendency 
to elicit its own stimulus word. This should 
make the frequency of response-stimulus 
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pairing somewhat greater than random pair- 


ings. 

Third, forward associative pairs should 
occur more frequently than reverse associative 
pairs. Stimulus words in many association 
tests are selected because they show high 
frequencies of association with 1 given re- 
sponse. The presence of he sumulus-re- 
sponse order in a word list of’. “ aplies that 
associations in that direction are greater than 
in the reverse direction. Little systematic 
research has been done with the frequencies 
of reversed associations since occasions for 
measuring such associations have seldom 
arisen. It would be expected that on the 
average forward associations would be at 
greater strength than reverse associations. 

Fourth, persons who show high common- 
ality of response (6) in the typical free 
association situation should demonstrate as- 
sociative clustering in recall to a greater ex- 
tent than persons who show low common- 
ality. A high commonality index for an 
individual indicates the presence of many 
popular associative bonds. This should lead 
to a stronger tendency for such an individual 
to recall appropriate pairs of words. 

The first three of these hypotheses were 
tested in the framework of the present ex- 
periment. 


PROCEDURE 


The procedure in this study was similar to Bous- 
field’s (3) with the exception that the word list used 
was made up of randomly presented stimulus and 
response words from the Kent-Rosanoff association 
test. 

The pairs of stimulus and response words were 
selected from the list primarily on the basis of fre- 
quency. It was necessary that neither the stimulus 
word nor the response word of any pair be the same 
as the stimulus or response of any other pair. It 
was felt that a list of approximately 50 words would 
be neither too difficult nor too easy for the recall 
task. The 40 most frequent pairs in Schellenberg’s 
(7) normative data for college students were 
selected, and subsequent eliminations on the basis of 
overlapping stimulus and response words reduced 
the list to 24 pairs. The lowest frequency pair oc- 
curred 288 times in Schellenberg’s 925 Ss. In those 
cases where the same pairs appeared in the test in 
reverse order (e.g., black-white, white-black) the 
pair with the highest frequency was the pair an- 
alyzed as a forward association in the experiment. 
The words were randomized and the list was 
checked to avoid the contiguous appearance of any 
pair of words in the forward stimulus-response 
order. 


Table 1 lists the pairs used. The number follow- 
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ing each word indicates its position in the random- 
ized list. 

The word list was presented to two groups of Ss. 
Group 1 (N=39) was an introductory class in lab- 
oratory psychology. Group 2 (N=62) was an ad- 
vanced class in the psychology of individual differ- 
ences. The two-classes were chosen for independent 
replication. 

The Ss were told that they were participating in 
a formal research project. After they were provided 
with sheets of composition paper, they were given 
the following instructions: 


I am going to read a rather long list of words. I 


want you to listen carefully and when I am through 
we will see how many you can remember. 


TABLE 1 


IN EXPERIMENT * 


RESPONSE RESPONSE 


Table (37) Chair (17) 
Man (30) Woman. (18) 
Mountain (1) Hill (21) 
Black (36) White (12) 
Mutton (44) Sheep (16) 
Slow (46) Fast (6) 
Rough (25) Smooth (40) 
Carpet (5) Rug (47) 


STIMULUS 





Eagle (24) Bird (11) 
Lamp (33) Light (32) 
Cottage (22) House (4) 
Ocean (45) Water (7) 
Long (29) Short (3) 
Square (31) Round (38) 
Butter (20) Bread (8) 
Lion (13) Animal (26) 
High (14) Low ** (27) Bed (35) Sleep (43) 
Sour (41) Sweet (34) Tobacco (2) Smoke (9) 
Cabbage (23) Vegetable (28) Scissors (42) Cut (10) 
Hard (19) Soft (48) Blossom (15) Flower (39) 


* Number in parentheses refers to position of the 
word in the randomized list. 

**In class 1 this was misread as lion so that in effect 
this class had only 23 pairs from the Kent-Rosanoff list. 





The order in which the words are given or recalled 
is not important, but the number you will be able to 
recall is. That is, you can remember the words in 
any order as long as you remember as many as possible. 
Since the list is long, you are not expected to remember 
all the words. Just do the best you can. I will ex- 
plain the exact procedure for the recall test as soon 
as I have finished the list. Are there any questions? 

Listen carefully and remember all the words you 
can: here is the list. (Here the list was read at the 
rate of approximately one word per second.) That 
ends the list. Now take the piece of lined paper and 
a pencil. Do not write anything. When I give you 
the signal start writing as many of the words as you 
can remember. Write down the left-hand column 
of the paper. Put one word to a line, and if you 
complete the left hand column, start down the right- 
hand side with a second column. At regular intervals 
I will ask you to draw a line beneath the last word 
you have written. Do this clearly and then go right 
on down the page. Make a clear separate line every 
time I say that. Write down the words in any order 
that they occur to you. 

Any questions? Ready, go! 


After working five minutes at the recail task, Ss 
were asked to raise their hands if they had produced 
any words in the preceding 30 seconds. They then 
resumed the recall task immediately. This question 
was repeated every 30 seconds until no hands were 
raised. In the first group this occurred at seven and 
one-half minutes. Analysis of the data for both 
groups was accordingly based on this time. 
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In order to create a serious experimental atmos- 
phere, the proceedings of the entire session were 
recorded and this fact was brought to the attention 
of the Ss.* 

The results of the recall test were analyzed for 
(a) the number of responses, (4) the number of 
forward associations, (c) the number of reverse asso- 
ciations, and (d) the number of arbitrary, non- 
Systematic pairs. 

Each occurrence of a stimulus word which was 
followed immediately by its response word was 
called a forward association. Each response word 
which was followed immediately by its stimulus 
word called a association. Arbitrary 
pairs defined as instances in which a 
wed immediately, not by its 


was reverse 


were those 
stimulus word was foll 


word, but by the response word of the 


own response 
pair succeeding it in Table 1 {e.g., table-woman, 
There seemed no reason to expect 
systemati words so selected. 
It was felt that such arbitrary pairs would provide 
of chance pairings. 


man-hill, etc.). 


associations between 


an estimate 


REsuULTs 

The major results relative to the hypotheses 
to be tested are presented in Tables 2 and 3. 

The first hypothesis was that forward 
associative pairs would occur more frequently 
during recall than would arbitrarily selected 
word pairs. This proved to be the case. For 
class 1, the mean number of associative pairs 
was 3.41, while the mean number of arbitrary 
pairs was 0.26. Using a ¢ test for related 
measures this difference proved to be signifi- 
cant far beyond the .oor level of confidence. 
The corresponding differences for class 2 
reached a similar high level of confidence. 
It is obvious that in this situation associative 
strength is a factor influencing the clustering 
of words. 

The second hypothesis was that reverse 
associative pairs would occur more frequently 
than arbitrary pairs. Here again appropriate 
t tests proved the mean differences to be sig- 
nificant far beyond the .oor level in both of 
the classes tested. 

The third hypothesis was that forward 
associations would occur more frequently 


TABLE 2 


MEAN AND STANDARD DeviATION oF RELEVANT 
VARIABLES IN CLASS I 


Mean 


VARIABLE 





Forward Associations 3.41 
Reverse Ass 
Arbitrary Associations 0.2 
Total Words Recalled 24.64 


ociations 2.55 





2 Record 


ec ings on Audograph disks are available on loan 
from the 


writers. 


James J. Jenkins aNnD Wattace A. Russet 


TABLE 3 
MEAN AND STANDARD DEVIATION OF 


a 


RELEVANT 





Forward Associations 
Reverse Associations 

Arbitrary Associations 
Total Words Recalled 





than reverse associations. In class 1 
difference was in the expected direction but 
was of only borderline significance (t—=1.81, 
05<p<.10). In class 2, however, the cor- 
responding difference yielded a ¢ of 3.37 
which, of course, is significant beyond the .o1 
level. Taken together, it seems reasonable 
to conclude that the third hypothesis is also 
confirmed. 

The importance of associative strengths is 
clearly evident when the combined forward 
and reverse associations are considered. In 
class 1 there are 6.26 such associations on the 
average and 6.08 in class 2. Since each asso- 
ciation consists of two words, it may be seen 
that these associations account for slightly 
more than 50 per cent of all words recalled. 

During the analysis of the data for class 1, 
it became apparent that there were sex differ- 
ences in the frequency of forward associa- 
tions. This difference in frequency was sig- 
nificant at the .o2 level, with females showing 
the greater number of associations. This 
was associated with a higher absolute level 
of total word production of females, although 
differences in total production between the 
sexes did not reach a high level of signifi- 
cance (p==.10). In class 2, however, differ- 
ences in total production reached significance 
at the .or level while the differences in 
number of associations showed only a slight 
predominance in favor of the females 
(.10<p<.20). Taken together, the results 
from the two classes would seem to warrant 
the conclusion that the females recalled sig- 
nificantly more words and produced signifi- 
cantly more forward associations than the 
males. 

It is, of course, possible that the sex differ- 
ences in forward associations are a direct 
result of the fact that the females recalled 
more words and thus had greater opportunity 
to produce associations. However, it may be 
observed that in class 2, where the difference 
in total production is greater, the difference 
in associations is less significant than in 
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class 1. This seems to imply that the rela- 
tionship is not a perfect one. In order to 
determine whether or not differences in asso- 
ciations would persist when there were no 
differences in total production, an analysis 
of sex differences was made with the number 
of words recalled held constant at the level 
of the S recalling the fewest words (this 
amounted to 15 words in class 1 and 10 words 
in class 2). In class 1 this procedure almost 
eliminated any differences in associations 
between the sexes (.50<p<.60), but in 
class 2 the females still showed a significantly 
greater number of associations (p=.o1). 
While combined probabilities would convinc- 
ingly support the conclusion that females 
produce more associations even when total 
word production is held constant, it would 
seem safest, in the absence of replication, to 
state this only tentatively. 


Discussion 
While the sex differences in associative 
clustering were not anticipated, their appear- 
ance is perhaps explicable. Schellenberg (7) 
has shown that for the Kent-Rosanoff word 
list female college students show a signifi- 


cantly greater commonality of response than 


do male students. If our fourth hypothesis 
(not tested in this experiment) is correct, the 
sex differences found here would be expected. 
This hypothesis states that persons who show 
high commonality of response should demon- 
strate associative clustering in recall to a 
greater extent than persons who show low 
commonality. Studies are now under way to 
examine in detail the relationships between 
commonality of response and associative clus- 
tering in recall. 

While it should be possible to manipulate 
average association strengths by using groups 
of students with different commonality in- 
dices, this is not the only way in which asso- 
ciative strength might be varied. Another 
approach would involve the use of special 
word lists in which the response words were 
those known to occur at relatively low fre- 
quencies. For example, a list using the same 
stimulus words as those used here but sub- 
stituting secondary or tertiary response words 
for the primary response words should 
accomplish this end. With the reduction in 
average associative strength, the amount of 
associative clustering should be reduced. 
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Of course it is recognized that associative 
clustering is but one of many factors deter- 
mining the order of recall. In our protocols 
the effects of serial order, clang associations, 
etc., were obvious. Nevertheless, in this 
situation it can be seen that once the S has 
recalled a stimulus word, the probability that 
a culturally frequent response will follow is 
far above chance. The fact that an S so 
responds to his own verbal production is of 
clear relevance to current formulations (4) 
which stress the importance of response-pro- 
duced stimulation in behavior determination. 


SUMMARY 

Two psychology classes recalled a word list 
comprised of stimulus and response pairs 
from the Kent-Rosanoff word list. The 
words were presented in random order. Both 
groups showed a highly significant tendency 
to recall the Kent-Rosanoff pairs together and 
in the stimulus-response sequence. Reversed 
associations (recall in the response-stimulus 
sequence) occurred significantly more than 
chance pairings but significantly less than the 
forward sequence. Females recalled more 
words and produced more forward associ- 
ations than males. In one class females pro- 
duced more forward associations even when 
total production was held constant. 

The results are interpreted as demonstrat- 
ing that associative strength is a factor in 
word clustering during recall. 
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JUDGMENTAL PHENOMENA AND SCALES OF ATTITUDE 


MEASUREMENT: 


ITEM DISPLACEMENT IN 


THURSTONE SCALES’ 
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N his influential study The Measurement 

of Attitude, Thurstone states as a major 

requirement for attitude scales that the 
scale values of statements be independent of 
the position held by the judges who do the 
initial categorization: 


If the scale is to be regarded as valid, the scale 
values of the statements should not be affected by 
the opinions of the people who help to construct it. 
This may turn out to be a severe test in practice, but 
the scaling method must stand such a test before 
e accepted as being more than a description 
people who construct the scale. At 
rate, to the extent that the present method of scale 
construction is affected by the opinions of the readers 
who help to sort out the original statements into a 
scale, to that extent the validity or universality of 
the scale may be challenged (19, p. 92). 


it can | 


or the anv 


The first research study of the Thurstone 
assumption was carried out by Hinckley (8) 
on attitudes toward Negroes. Opinion state- 
ments were categorized by two groups of 
white judges, one with anti-Negro attitudes 
and the other with pro-Negro attitudes, and 


by a group of Negro judges. Scale values 


1 The present study, the first of several on the topic 
of judgmental phenomena and scales of attitude measure- 
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were determined by the method of equal- 
appearing intervals. Hinckley found that the 
average scale values for the two white groups 
were highly correlated (r—.98) despite their 
difference in attitude on the issue and that the 
scale values for the anti-Negro white judges 
were also closely correlated with those for the 
Negro judges (r=.93). On the basis of these 
Hinckley concluded that 
which we have constructed for measuring 
attitude toward the social position of the 
Negro is not influenced in its measuring 
function by the attitudes of the subjects used 
in the construction” (8, p. 293). Substantially 
similar results were obtained in a later study 
by Pintner and Forlano (14), who used the 
Attitude toward Patriotism scales which had 
also been derived by the method of equal- 
appearing intervals. 

That the attitudes and opinions of the 
judges have no effect on the placement of 
items is in sharp conflict with the results of 
studies in the fields of perception and judg- 
ment. These studies indicate that judgments 
are greatly influenced by motivational and 
attitudinal factors operative at the time. Thus 
in the studies by Asch, Block, and Hertz- 
man (2) where subjects judged photo- 
graphs, political figures, and 
slogans along such dimensions as intelligence, 
honesty, and social usefulness, judgments 
were made “in accordance with an under- 
lying attitude of acceptance or rejection.” 
Other studies indicate that category intervals 
are directly affected by attitudinal factors. As 
long ago as 1914 Fernberger (7) showed that 
changes in attitude bring about changes in 
the range of judgments of equality, hence in 
the size of category intervals. A series of 
studies by Volkmann (20), Rogers (16), and 
McGarvey (13) showed that category intervals 


results “the scale 


professions, 
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are caused to expand or shrink by the experi- 
mental introduction of anchoring agents 
beyond or within the scale and that “anchor- 
ing can be achieved by appropriate verbal 
instructions, without the use of anchoring 
stimuli” (21, p. 287). 

Displacement of judgments along a scale 
seems a general phenomenon when the 
stimuli carry strong social and personal value. 
For example, Cartwright found that the S’s 
estimation of equivalence of statements was 
affected by his own position. “The size of 
the range of equivalence related to a ‘radical’ 
and to a ‘conservative’ differed, depending 
upon S’s own political attitudes” (3, p. 195) 
“For the radical S, the judgments of ‘radical’ 
extend 7.5 units while the judgments of 
‘conservative’ extend 1o units. For the con- 
servative S, on the other hand, the judg- 
ments of ‘radical’ extend 8.5 units, while the 
judgments of ‘conservative’ extend only 5.2 
units” (3, p. 189). 

In studying the response of Negro youth 
to the social value placed on light skin color, 
Johnson (10) found a tendency for self-rat- 
ings to be a shade or more lighter than they 
appear to the tester. This was shown even 
more definitively in the study by Marks (12) 
who matched each Negro S’s skin color with 
an objective hue. The tendency was to dis- 
place one’s own coloring as much as objective 
fact allowed in the direction of the desired 
norm of light brown. A dark individual, 
rating himself close to average, would dis- 
place individuals lighter than himself to 
lighter than average categories. In contrast, 
a light individual would judge the majority 
of the group as dark. While the relative 
position of each S is kept the same by differ- 
ent raters, the absolute position varies from 
rater to rater. 

It is interesting to note that in Cartwright’s 
study the Ss were more discriminating in 
accepting items at their own end of the scale, 
and lumped together items at the extreme 
opposite to their own stand, while in the 
study of Marks the displacements of items 
(skin color) were in the direction of the 
desired skin color. It is apparent that th 
direction of item displacement will depend 
upon both the stimulus material and the S’s 
motivation in relation to it. 

More recently Hinckley himself, in col- 


laboration with Rethlingshafer, added further 
substantiating evidence of the effect of the 
judge’s own attitude upon his judgments. 
Starting with the general assumption that 
“men judging heights of men on a scale of 
short to tall would be influenced in their 
scale values by their own heights” (9, p. 257), 
these investigators found that “the judgment 
of the average height of all men is influenced 
by the height of the man making the judg- 
ment. The ‘meaning’ of the social value 
terms of ‘short’ and ‘tall’ is in part determined 
by the height of the judge. The ‘egocentric’ 
influence is also controlled by the objective 
facts, particularly in judging the ‘extreme 
heights” (9, p. 262). 

Thus we have the paradox that some lead- 
ing texts in social psychology say in the 
chapter on perception and judgment that 
judgments are greatly affected by the indi- 
vidual’s attitudes and motives, while in the 
chapter on scaling methods they state that 
judgments of the meaning of items are 
unaffected by the position of the judges who 
do the sorting. Analysis of prior attitude 
scale studies suggests a possible explanation 
for the discrepancy: the utilization of too 
narrow a range of individuals, with the con- 
sequent elimination of the extreme Ss who 
would show variation and displacement of 
judgments most clearly. In the Pintner and 
Forlano study, the question may be raised as 
to whether intense personal involvement was 
achieved on the issue of patriotism during the 
thirties, and the authors report that “No 
group could be called unpatriotic” (14, p. 41). 
In the Hinckley study (8), on the other hand, 
considerable personal involvement in the issue 
would be expected, particularly on the part 
of the Negro judges. But the report suggests 
that here again the range may have been 
unintentionally narrowed by a procedure he 
intended only to eliminate “careless” subjects: 


One tendency which revealed itself in the sorting 
of the subjects was the bunching of statements in 
one or more piles to the apparent detriment of the 
other piles. This phenomenon of bunching at the 
extremes was noticed in the case of certain of the 
white subjects, but was especially noticeable in the 
Negro subjects. Since the 114 statements are dis- 
tributed with fair uniformity over the entire scale, 
marked bunching is a sign of careless sorting. If 
more than a fourth of the statements are assigned to 
any one pile, it will leave less than three-fourths to 
distribute over the remaining ten piles. Further- 
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more, the individual who sorts the statements in 
this fashion often ignores some of the piles com- 
pletely. On the assumption that this bunching was 


j } 


ti carelessness every 


due to poor discrimination and 
case having 30 more statements in any one file 
sutomatically eliminated from consideration, 
re not re orded (8, p- 283).? 


was 


resuuls we 


From the studies of judgment cited above 
it would be predicted that this piling up of 
statements in certain categories by some of 
the white Ss and, in a more noticeable way, 
by the Negro Ss may be due to the effect of 
the Ss’ strong attitudes on the topic. It may 
be that by eliminating these Ss the investi- 
gator eliminated, at the same time, those who 
had the most accentuated attitudes on the 
issue. It is interesting, in connection with this 
hypothesis, that even after all Ss placing 30 
or more statements in any category were 
eliminated, there was still “a slight tendency 
for Group I, which is prejudiced against the 
Negro. te judge a ¢ statement to be more 
favorable to the Negro than Group II [favor- 
able to the Negro} judged it” (8, p. 293). 


PROBLEM 

The previous studies of the Thurstone scale 
do not appear to the writers to have fully 
settled the issue of whether or not scale values 
are affected by the attitude of the judges, 
since there does not appear to be sufficient 
evidence that adequate representation was 
made of Ss at the extremes, with intense in- 
volvement in the issues being judged. We 
were, therefore, interested in reinvestigating 
the problem under conditions which would 
provide the maximum opportunity for the 
operation of the types of factors which have 
been found important in the field of percep- 
tion and judgment. It is hoped that such a 
study will be a contribution to a rapproche- 
ment between the areas of judgment and of 
attitude measurement. 

Accordingly, the Hinckley experiment was 
duplicated using attitudes toward Negroes, 
but with means of insuring that the Negro 
and pro-Negro white judges were not selected 
in any way which might exclude those with 
the strongest attitudes on the issue, as we 
thought possible with the Hinckley procedure 
of eliminating individuals who placed 30 or 
more items in any one category. Moreover, 
checks were incorporated to evaluate his in- 


2 Italics ours. 


Cart I. Hovitanp AND Muzarer SHERIF 


ference that such individuals were “careless.” 

On the assumption that the set of 114 state- 
ments used by Hinckley represents a wide 
range of positions, with some clearly pro and 
others clearly con, and that a large number 
are of a neutral character, a favorable situ- 
ation exists for studying how internal factors 
such as motives and attitudes affect the place- 
ment of the rather unstructured and am- 
biguous middle-position statements. The 
literature on judgment and perception cited 
earlier clearly suggests the general hypothesis 
that the position of a judge on an issue in 
which he has strong personal involvement 
will constitute an “anchor” for his judgments, 
whether or not his attitude is specifically 
called for in the instructions. The specific 
hypotheses for the present study may be 
stated in summary form as follows: * 

I. Judges with extremely pro or con atti- 
tudes will show a tendency to concentrate 
their placement of items into a small number 
of categories. 

II. Judges with an extreme position and 
strong personal involvement will be highly 
discriminating in accepting items at their 
own end of the scale. They will correspond- 
ingly display a strong tendency to lump 
together statements at the end of the scale 
which they reject. The former tendency can 
be described as a raised threshold of accept- 
ance and the latter as a lowered threshold for 
rejection. 

III. A greater degree of displacement will 
occur for the “neutral” items and a smaller 
degree for the sharply defined pro and con 
statements at the extremes. 

In less technical terminology these hypothe- 
ses state that individuals with strong personal 
involvement will tend to see issues pretty 
much “in all black or white” rather than 
with fine distinctions, and that statements 
even mildly critical of their position will be 

8 These hypotheses are formulated in relation to the 
placement of items in categories along a pre- 
scribed by the experimenter. Such scales and categories 
may be referred to as “imposed” scales and “imposed” 

zories. We have also extended these hypotheses 
by allowing each S to define his own scale by choosing 
the mumber of categories he personally considered 
necessary to distinguish the different positions in the 
issue. Results using these methods, obtained in con- 
junction with the present experiment, will be reported 
im a separate paper. 


Studies in addition to those already cited which bear 
on our hypotheses include 1, 4, 11, 15 and 17. 


scale 


categ 
a 
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judged to be more hostile by them than by 
more neutral individuals. 


METHOD 
Procedure 

The problem presented above requires a design 
satisfying the following: (@) a topic which has 
strong personal involvement for Ss chosen to rep- 
resent the pro or con stand on the issue, (4) clear 
differentiation of Ss with respect to their position 
on the issue, (c) administration of the judging task 
without giving any hints that an attitude study is 
involved, and (d) checks to determine whether 
heavy concentration of statements in any category 
is due to “carelessness.” For these reasons, the 
Attitude Toward Negro scale was considered to 
be the most appropriate topic, and through the 
kindness of Professor Hinckley we were able to 
secure the original 114 staternents he used concern- 
ing the social position of Negroes. These statements 
had been selected to include the entire range from 
very favorable to very unfavorable on the issue and 
to be distributed in a fairly representative way over 
the range. 

The prime consideration in each step of the 
procedure was that all means be taken to insure 
careful and uniform administration to relatively 
small groups of Ss in each session and to be certain 
that Ss understood the instructions at each point. 
In every case the experiment was administered by 
graduate assistants who had been thoroughly briefed 
concerning the procedure, who had themselves car- 
ried out the instructions under close supervision, 
and who had in most cases practiced the procedure 
on a trial group. A special point was made to 
administrators who would appear to Ss to be 
“one of them.” The groups were kept small in 
number whenever possible, in order that adminis- 
trators could easily but unobtrusively observe Ss 
while instructions were being carried out. On the 
whole the sorting was administered in large rooms, 
chiefly in laboratory space, in order to give each S$ 
ample room to spread out the statements before him. 

Categorization of items. The first task for the 
Ss consisted of sorting the statements into piles. 
Each of the 114 statements was mimeographed on 
a 3X5 in. card. To facilitate later tabulation, 
an arbitrary code number unrelated to the content 
of the item was assigned to each statement - 
stamped on the back of the card. No S asked ; 
question concerning any possible connection oq 
tween the statement and the code number. As a 
further poeeeeen, the cards were shuffled before- 
hand so that S would not be able to get an idea 
ibout the exact number of statements. It was 
thought that the knowledge of the exact number 
of statements might tempt him to divide the total 
number of cards by the number of categories to 
give him a fixed guide for the number to be as- 
signed to each category. 

Deliberately, no mention of attitude measure- 
ment was introduced in the instructions until the 
sorting phase was completed. It was only after 
the sorting was completed by S that instructions 


select 


aiming to tap his attitude on the issue were intro- 
duced. To insure this, the instructions pertaining 
to sorting and those requiring the checking of 
S’s particular stand on the issue were put on differ- 
ent sheets, and those relating to the attitudinal aspect 
were not passed out until the sortings were com- 
pleted. This temporal sequence was employed in 
line with our basic hypothesis that, whether or not 
the attitude of Ss was explicitly activated, the Ss 
would reveal it in their behavior when faced with 
appropriate stimuli. 

After receiving the envelope containing the 114 
statements, Ss were given the appropriate instruc- 
tions for sorting. The instructions for the actual 
procedure of sorting were identical with those used 
by Thurstone and Chave (19), Hinckley (8), and 
others. The Ss were given 3 to 4 minutes to read 
the instruction sheet. The administrator then read 
the following instructions aloud. 

You are given a number of statements expressing 
opinions in regard to the social position of Negroes. 
These cards are to be sorted into different piles. 

You will find it easier to sort them if you look 
over a number of cards, chosen at random, before you 
begin to sort. 

You are given eleven cards with roman numbers on 
them: I, Il, Ill, IV, V, VI, VII, VIII, IX, X, XI. 
Please arrange these before you in regular order. Under 
Card I, put those statements which are most unfavorable 
in regard to the social position of Negroes. Under Card 
XI, put those statements which are most favorable in 
regard to the social position of Negroes. Under each of 
the other 9 cards, between I and XI, put those statements 
which correspond to that step in the 11 piles.4 

This means that when you are through sorting you 
will have 11 piles of statements arranged in order 
from I, the lowest, to X1, the highest. 

Use your judgment as to where each statement should 
be placed in the 11 piles. Do not be concerned about 
the number of cards in each pile. 

When you are through sorting, please put a rubber 
band around each of the 11 piles of cards, placing the 
numbered card on top of each pile. 

If you complete the sorting before others, please remain 
quietly in your seat until final announcements are made 
by the experimenter. 


After the appropriate instructions for sorting were 
read aloud, the administrator answered any ques- 
tions concerning the sorting to be sure that instruc- 
tions were understood. Administrators were, how- 
ever, cautioned not to use examples relating to 
Negroes to clarify the instructions. For example, 
as an instance of a very unfavorable statement 
on an issue, administrators used “Organized religion 


* Probably for this issue, it would have been preferable 
to use I as the favorable and XI as the unfavorable 
end. The few sortings done by Negro Ss under indi- 
vidual supervision showed a tendency to switch I to 
the favorable end unless reminded that I represented 
the unfavorable end. Therefore, special care was taken 
in the experimental sessions (especially with Negro Ss) 
to repeat several times that I was the unfavorable, XI 
the favorable end of the scale. To provide maximum 
comparability we conformed to Hinckley’s use of I 
as the most unfavorable end. A small number of Ss 
consistently reversed the two extremes and these data 
were recopied in the standard direction. 
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is the greatest single detriment to the advancement 
of civilization in the world today”; or, as an ex 
ample of a very favorable statement on an issue, 
Organized religion is the only hope for the salva 
tion of the world.” 

the instructions for sorting were clear to 
sorting began. During this time, the 
administrator distributed rubber bands to each S§ 
and perform ther tasks so that he would be in 
a position to that each S§ was conscientiously 
k according to instructions, with 
out seeming to be observing the placement of 
The sorting process consumed about 
55 minutes. The Ss were reminded to remain in 
their seats and keep their cards with them until all 


Vhe n 


the Ss, the 


performing the ta: 


specific cards. 


Ss had finished sorting. 

Obtaining the judges’ own attitudes. The second 
instruction sh which concerned the Ss’ agree 
ment and disagreement with the piles of statements 


id sorted, was then distributed. After sorting, 


ect, 


they 


the following instructions were given for the pur 


ose of tapping Ss’ attitudes on the issue 


the sorting of 


« ards 


completed 
that pile of 
the issue (that is, the 
On numbered c 
rit the wo . 

this 

the 
str 


care 
which comes 
view on 
Negroes 


the 


lential 


dural step was to have the 
ents which they 
“unfavorable, 
favorable” to 
would 
propor 
and tl 


pro 
roportion of 


ry untav 
ble,” and “very 
hether 
’ 


between 


or not there 
actual 
1 
ious pies 
port he he I< 
portions When the cards 
is critical for the 
re is a general correspond 

e patterns of these percentages fi 
tual sight of the objective 
actual proportions into 
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explanation for dis 


or more stat 

ippears inadequate 
ictual proportions in 
*s filled in by Ss at a 
1 consistency in evalua 


1 at two time intervals 
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which would not be expected if the sorting were 
casual and random. 

The estimates of sortings were obtained after all 
Ss had replaced the statements in their envelopes. 
The administrator passed out a 3X5 in. card to 
each S individually so that he could to be 
sure that the sorted piles were in the envelopes. The 
administrator explained the task orally as follows: 

Now that 
in terms of 
the 


check 


the statements into piles 
Negroes, 


which in 


you have sorted 
their stand regarding 
percentage of statements 
were “very unfavorable” to Negroes, 
to Negroes, “neutral,” “favorable” to Negroes, 
to Negroes. Write the per cent 
stands printed on the card from 
“very favorable” in the appro- 
column headed “% of state- 


please 
your 
“un- 


estimate 
judgment 
favorable” 
and “very 
for each of 
“very unfavorable” to 
priate under the 
ments regarding Negroes.” ‘The five percentages which 
you write should add up to 100%. You note 
that there is a question at the bottom of the card to be 
back. I believe the other items are 


favorable” 
these 


space 
will 


answered on the 
self-explanatory. 

Supplementary attitude measurement with white 
judges. It was important to have a clearer differen- 
tiation of groups than that afforded by the rather 
crude list of six statements used for this purpose by 
Hinckley. In the case of Negro Ss, the intensity 
of agreement and disagreement with the piles of 
statements they had sorted and the obvious attitude 
of Negro Ss suficed. But for the white Ss, a further 
check on their attitudes was obtained by adminis- 
tration of the Likert Negro Scale ® as the final step 
in the session. The scale was not administered to 
Negro Ss discussion with Negro pretest 
Ss revealed a strong animosity toward filling out 
a form prepared entirely from the white point of 
view. In fact, some Negro Ss spontaneously re- 
marked that the statements to be sorted were pre- 
pared from the white standpoint. 


because 


Subjects 


The final procedure presented above was decided 
upon after pretesting on both white and Negro Ss 
(N=100). Ten Ss made the i 
under close supervision with 
detailed hints for the procedure. 

The main concern in the selection of the final 
experimental Ss was to insure groups of Ss who 
differentiated segments on the 
ely favorable position 
gard to 


sortings individually 
the view of getting 


would represent 
scale ranging from an extren 
to an extremely unfavorable 
the the social position of Negroes. For 
this purpose, groups and institutions were selected 
which would insure a likelihood of obtaining such 
differentiated positions on the scale 

Ne g? It followed that Negro Ss should 
be used, to represent the pro end of the scale. A 
total of 103 were used. Fifty-four were enrolled 
as graduate students or were attending the summer 
session at the Univ of Oklal These more 
Negroes may be taken as most self 
pro group in this study. Since these 
were attending the University in the 


position in r 


issue of 


) groups. 


rsity yma 


mature the 
consciously 


individuals 


See Likert, R. A technique for the measurement of 
attitudes. Arch. Psychol., N. Y., 1932, No. 140. 
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year following the Supreme Court decision ending 
segregation at the University, they had a sense of 
mission and an accentuated identification as mem- 
bers of their group. Many were brought to the 
university by the National Association for the Ad- 
vancement of Colored People, which has encour- 
aged Negroes on the campus to form active social 
groups. As a group, these students had had more 
actual contact with discrimination practices than 
the younger Negro Ss. 

The younger group of Negro Ss was made up 
of 49 undergraduates at a state university for 
Negroes; hence, their age and educational level are 
lower than that of the University of Oklahoma group. 
While segregation and awareness.of anti-Negro 
feeling had been a part of their life, they had grown 
up largely in a Negro environment. Since this uni- 
versity is also located in a Negro community, this 
relatively “sheltered” condition is continued in 
academic life 

White groups: 1. Pro-Negro. Because validity de- 
rived from the stands taken in actual life was the 
desired criterion governing selection of subject 
groups, two small groups of white Ss who had 
actually participated in pro-Negro activities were 
chosen. Members of these groups, totaling 19, had 

en among the leaders in local campaigning at the 
University of Oklahoma (by student polls, news 
paper work, etc.) for an end to segregation. One 
group was selected by the director of the YMCA 
as among those individuals who were thus active. 
Another group had been organized around this 
issue. Thus, for these very pro-Negro white Ss, 
personal information as well as details of the stand 
of their groups were available. 

2. “Average” subjects. Here results from a num- 
ber of different colleges throughout the South were 
These included two in an area in south 
Oklahoma popularly known as “Little 
Dixie.” In general, students attending these schools 
are from a level than those 
attending the state university. The educational level 
of the area is relatively low. Affiliation with funda 
mentalist religious groups is common. The students 
in these colleges are, for the most part, preparing 
to become teachers. From one, 53 Ss were chosen 
from classes in elementary English. From the 
other, 58 students, who had not been exposed to 
courses in either psychology or sociology, were 


secured. 
eastern 


lower socioeconomic 


used. 

Three colleges and universities from the state of 
Georgia were also secured. Sixty-four Ss were ob- 
tained, distributed about equally through the three 
institutions. In all, 194 white judges were used. 

3. Anti-Negro subjects. It was hoped to secure 
a group of strongly anti-Negro Ss. The major 
difficulty in finding strongly anti-Negro Ss lay in 
the fact that no groups organized strictly around 
such a stand were available. Further, pretesting re- 
vealed a strong tendency on the part of college stu 
dents to express more liberal attitudes than would 
probably have been expressed at the time of the 
original Hinckley study. A number of such Ss 
who were believed to be somewhat less liberal on this 
issue were obtained from living units at the Univer- 
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sity of Oklahoma. But it finally appeared necessary 
to use the Ss in the total white sample who ex- 
pressed the most anti-Negro attitudes on the eval- 
uation cards and on the Likert questionnaires. 
REsuLts 
Categorization of Statements by Various 
Groups 

One method of analyzing any differences 
between the major groups of Ss in the degree 
to which items are displaced is to determine 
the number of statements placed in each of 
the eleven categories. Results on these dis- 
tributions are shown in Fig. 1 for (a) Negro 
Ss, (b) strongly pro-Negro white Ss (leaders 
in the anti-segregation movement), and 
(c) “average” white Ss from six Southern 
colleges. To permit comparison of the 
results for the last group with those ob- 
tained by Hinckley, his procedure of elimi- 
nating all individuals placing 30 or more 
statements into any single category was also 
studied. Data are also presented for a group 
of 17 Ss who appeared to have the most anti- 
Negro attitudes within the white group. 
These 17 Ss had indicated disagreement with 
the pile of statements most favorable to 
Negroes and had markedly anti-Negro Likert 
scale scores (below 50). 

It will be observed that the Negro group 
shows a heavy piling up of statements in the 
extreme categories, indicating displacement 
of neutral statements to the extreme position. 
This displacement is particularly noticeable 
for category I at the anti-Negro end, a posi- 
tion with which these Ss strongly disagree. 
The distribution of responses of the pro- 
Negro white group is similar to that for 
Negroes but a little less extreme. The dotted 
line at 30 indicates the frequency which 
Hinckley used to eliminate Ss. Over three- 
fourths of the Negro Ss and two-thirds of 
the pro-Negro white Ss would be eliminated 
by applying the Hinckley criterion. 

The data for the average white group show 
considerably less piling up at the ends. When 
the Hinckley elimination procedure is fol- 
lowed the distribution is relatively flat. The 
differences in the distributions for the Negro 
and average white are highly significant 
statistically. 

The results for the anti-Negro Ss are not 
strikingly different from those for the average 


white Ss. But it will be noted that as com- 
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pared with the white Ss they tend to show a 
slightly greater tendency to concentrate items 
at the positive end of the scale. 

Scale for 


Subgroups 


Values the Various Criterion 

Another method of studying displacement 
or items 1S to investigate the distributions of 
obtained for the various sub- 
groups. These are the data on which Thur- 
stone, Hinckley, and others base their main 


case for the validity of their scaling proce- 
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dure. Inspection of the original 114 items 
reveals that all of the statements are not 
equally appropriate for white and Negro Ss. 
For example, items like “I would not patron- 
ize a hotel that accommodates Negroes” or 
“My lack of contact with the Negro makes 
it impossible for me .» pass judgment as to 
his social position” are not as relevant to 
Negro Ss as to whites. But it was possible 
to find 11 items which seemed equally appro- 
priate for the Negro and white populations, 
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and which were approximately equidistant in 
the original Hinckley scaling. Scale values 
were then found for each of these items for 
each of the present groups, using the conven- 
tional Thurstone procedure. The eleven 
items were all used in the final forms of the 
Scale of Attitude toward the Negro.® They 
are numbered 5, 6, 10, and 16 in Scale A, and 
I, 2, 3, 5, 8, 9, and 15 in Scale B. 

The distributions of scale values for these 
eleven items derived from the responses of 
the various groups are presented graphically 
in Fig. 2. For comparison, the scale values 
reported by Hinckley are given. It will 
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NEGRO SUBJECTS 
otns 
ANTI- NEGRO 
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ScaLeE VALUES OF 11 SELECTED ITEMS FOR 
EacH oF THE Four Major Groups 


Fic. 2. 


Values obtained for these items by Hinckley (8) 
are included for comparison. Number of Ss in each 
group is the same as in Fig. 1. 


be seen that there is considerable simi- 
larity in the scale values for the average 
white Ss (with the Hinckley procedure of 
eliminating those with 30 or more statements 
in any category) and the scale values orig- 
inally found by Hinckley. The correlation 
between the two sets of scale values is .96 (11 
items). But the values for the strongly pro- 
Negro white judges and for the Negro judges 
do not correspond at all to those obtained by 
Hinckley. The distributions of the Negro 
groups show the majority of the items 
bunched up at the extreme anti-Negro end 
of the scale (in line with the results of Fig. 1). 
The items which are neutral for Hinckley’s 
group and for our “average” white Ss are 
displaced by the Negro judges to the ex- 
tremes, principally to the anti-Negro end. 
The distribution of scale values for the pro- 
Negro white Ss is similar to that for the 

6 Hincxiey, E. D. Aéttiude toward the Negro, Scale 


No. 3, Forms A and B. (L. L. Thurstone, Ed.) Chicago: 
Univer. of Chicago Press, 1930. 
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Negro Ss but again not quite as extreme. 
The results for the anti-Negro Ss also show 
a tendency for neutral items to’ be displaced 
toward the extremes, but in their case some 
displacement is also noticeable toward the 
pro-Negro end. 


“Carelessness’ as an Explanation for the 


Results 

It will be recalled that Ss who placed 30 
or more statements in any single pile were 
discarded by Hinckley on the grounds of 
carelessness. To obtain a check on this ex- 
planation for the large number of our Ss who 
placed this number of statements in the ex- 
treme position piles, after the sorting was 
completed and the cards placed in their 
envelope, a question was asked concerning 
the Ss’ impressions of the percentage of the 
statements which were “very unfavorable,” 
“unfavorable,” “neutral,” “fairly favorable,” 
and “very favorable” to the Negro. From 
these data we have evidence that the Ss: who 
placed a large number of their cards in the 
extremely unfavorable categories may really 
have perceived the statements in that way and 
did not place them there through carelessness. 
The data are presented in Table 1 to show 
the difference in the evaluations of the num- 
ber of unfavorable statements in the list 
among Ss who placed less than 30 statements 
at the anti-Negro end of the scale, among 
those who placed between 30 and 59, and 
among those who placed 60 or more in this 
category. The Ss who placed a greater number 
of statements in the anti-Negro category were 
definitely more inclined to believe that there 
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were a high number of unfavorable state- 
ments in the list. The chi-square value for 
the distributions is 43.3, which gives a proba- 
bility value of considerably less than .oo1. A 
very high degree of relationship is not to be 
expected in view of the fact that Ss had not 
been given any prior intimation that -they 
would be required to make such a judgment. 
For the entire group of 258 Ss on whom the 
relevant information was available, a corre- 
lation of .32 was obtained betweer the per- 
centage of statements considered “very un- 
favorable” and “unfavorable” and the num- 
ber of cards placed in the extreme anti-Negro 
category. ‘This correlation is significantly 
different from zero at considerably less than 
the .oor level. These data suggest that Ss 
who put a large number of statements into 
do so not because of 


the extreme category 


carelessness but because they actually perceive 


the statements differently. 


Discussion 


The results just presented clearly support 
the three principal hypotheses investigated. 
Even though judges are not asked to state 
their own attitudes, the position of the judge 
luence in a significant 


on the issue does inf 


way his categorization of the items. The 
displacements are away from the individual's 
position, notion that 


supporting the 
those with extreme positions are more selec- 


own 
tive in the statements they will accept at 
their own end of the scale (their “threshold 
of acceptance” is raised), and more inclined 
to reject items which are at variance with 
their position (their “threshold of rejection” 
s lowered). The degree of displacement is 
greatest for the items in the middle of the 
scale, in line with the hypothesis that “neu- 
tral” items are more ambiguous’ and less 
well structured, and hence are more readily 
to the personal interpretations of the 


subject 
ives. 
The present data do not indicate whether 
stortions and displacements are caused 
that with extreme 


yu 
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the di 
by tl 


1e fact individuals 
“lower Q 
higher Q 
163). On 


has called attention to 
the scale and 
he scale” (p. 
lwards concluded “Clearly, 
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“neutral” 


indicate 
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ours.) 
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positions lack the ability to discriminate be- 
tween adjacent items at the opposite extreme, 
and hence place them in the same category, 
or whether they reflect variations in the 
judge’s interpretation of the total scale and 
the magnitude of category intervals. On the 
surface, the second alternative is supported 
by the study of Ferguson (6) who found no 
differences in the distributions of paired com- 
parisons for judges with differing attitudes 
toward war. But in the light of the present 
results a question is again raised as to the 
magnitude of differences in attitudes of his 
contrasted groups of judges and in the degree 
of their involvement *in the issue. During 
the thirties, when the study was done, it is 
likely that variation in attitude toward war 
may not have been very pronounced, and 
Ferguson himself states that it was impossible 
“to secure any groups favorable toward war” 
(p. 116). The data of the present experiment 
suggest that it is quite likely that the effects 
are specific to methods of absolute scaling 
(the alternative), since the rank 
orders of the items for the pro and con judges 
are very similar. An experiment is currently 
underway specifically on this problem, in 
which the Attitude toward the Negro issue 
is again used with Negro Ss in an attempt to 
see whether the items which are bunched 
together with the equal-appearing interval 
method are clearly differentiated with the 
method of paired comparison. 

On the basis of the present study it appears 
quite likely that the startling difference be- 
tween the present results and those of 
Hinckley is due to his elimination of the 
extreme cases. When his elimination proce- 
dure is followed and the sample restricted to 
white Ss, there is a close correspondence in 
results even though 20 years have elapsed 
since his original study. 

Evidence against the explanation of the 


sece nd 


results as due to “carelessness” comes from 
several sources. The data already presented 
indicate that those who placed a large num- 
ber of the statements in the extreme anti 
categories judged at a later time that more of 
the items had been unfavorable to Negroes. 
Further evidence comes from another study 
done in conjunction with the present one in 
which Ss were allowed to use as many or as 
few categories for sorting as they deemed 
necessary to distinguish among the items on 
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the issue. The Ss who placed a large num- 
ber of items in the extreme categories under 
the present instructions used fewer categories 
to express shades of opinion and again placed 
a large number in the extreme categories. A 
different type of evidence is the lawful nature 
of the phenomenon. It is the very subgroups 
with extreme positions (determined either 
by their own attitude statements or by their 
assignment to a criterion group) who place 
the large number at the extremes, and the 
items they displace are those in the middle 
of the scale (the more ambiguous items). 

Our results do, as Thurstone (19) had 
clearly anticipated, raise serious questions as 
to the use of the method of equal-appearing 
intervals on issues where strong attitudes are 
involved. Further research is needed to 
determine whether under these conditions a 
single scale is involved for the different 
groups of judges or whether distinctive scales 
will be required. A Guttman or Lazarsfeld 
type of analysis will be required to answer 
this question (18). Even if a single scale is 
involved, great care will be required in the 
selection of judges, since their position may 
influence the scale values obtained. Obtain- 
ing an unbiased sampling of positions may 
constitute a difficult practical problem, since 
on many issues attitudes are related to edu- 
cational and socioeconomic status. It may 
therefore be very difficult to secure repre- 
sentatives of all positions among those who 
are well enough educated and have sufficient 
sophistication to carry out the difficult task 
of categorizing the items. 

From the present results there emerges an 
interesting possibility for developing a be- 
havioral, “projective” method of attitude 
assessment through study of the way an indi- 
vidual sorts statements on an issue. If the 
tendencies found in the present experiment 
for individuals with extreme positions to 
bunch up the statements at the extremes are 
found for other issues, it may be possible to 
assess the attitude of an individual without 
ever asking him his opinion but by relying 
entirely on the way he distributes his judg- 
ments. Individuals with more or less neutral 
attitudes would be expected to space their 
judgments rather evenly over the entire 
range, those at the pro end would tend to 
reject neutral items and hence pile them up 
at the anti end, and those with anti attitudes 


g 
31 


would place them at the opposite end of the 
scale. As indicated previously, one may even 
carry this technique a step further by not 
using a fixed number of categories, but by 
deliberately letting the individual establish a 
scale of his own with whatever number of 
categories he sees fit. Then the extension or 
constriction of the scale he establishes, the 
number of categories he uses, the crowding 
or neglecting of certain categories, and the 
direction of concentration of items may all 
provide useful indices for analysis of the way 
the individual “perceives” the issue. In such 
a technique one would obviously want to 
insure the inclusion of a large number of 
ambiguous and unstructured items since these 
will be the ones which are more conducive 
to displacement. In another study done in 
conjunction with the present experiment, 
investigation has been carried out along these 
lines and the results will be reported in a 
forthcoming paper. 
SUMMARY 

1. Investigation was made of the apparent 
discrepancy between studies in the field of 
judgment, where the position of the individ- 
ual on an issue affects the nature of his judg- 
ments, and those in the field of attitude 
scaling, where it has been reported that the 
scaling of statements is independent of the 
judge’s stand on the issue. 

One possible hypothesis for the discrepancy, 
based on an analysis of the original studies, 
was that prior experiments had not employed 
Ss with a sufficiently wide range of attitudes 
to represent adequately the strongly involved 
Ss who would be most likely to show the 
distortions and displacements typically ob- 
tained in the field of perception and judg- 
ment. Accordingly, every effort was made 
to secure samples of Ss who were deeply 
involved in the issue about which the items 
to be judged were written. Attitudes toward 
the social position of the Negro furnished an 
issue on which strong personal involvement 
could be secured, particularly on the part of 
Negro Ss. The materials to be judged were 
the 114 statements originally used by Hinck- 
ley (8), and his instructions and procedures 
were duplicated. In addition to Negro 
judges, groups of white Ss with known pro- 
Negro attitudes, “average” white Ss, and a 
small group of anti-Negro whites, were em- 
ployed as judges. 
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It was found that Negro Ss and mili- 
tantly pro-Negro white Ss tended to place a 
disproportionate number of statements in the 
extreme categories, in line with the predic- 
tions from judgment theory that individuals 
attitudes tend to see issues in 
“black and displace neutral state- 
ments to the extremes. The numbers of 
statements assigned to the various categories 
much more 


with strong 


and white” 


by “average” white Ss are 
uniform. 

3. Scale values for 11 statements spaced 
rather evenly over the entire scale in Hinck- 
ley’s study were compared for each of the 
major groups. The distributions of scale 
values for the Negro and pro-Negro white Ss 
were ‘compressed at the anti Negro end ¢ 
a small number at the ie 
extreme. Those for the “average” white Ss 
were spaced more evenly throughout the 
range in a manner closely approximating the 
distribution found by Hinckley. Anti-Negro 
Ss tended to displace neutral statements to the 
extremes, particularly toward the pro-Negro 
end. 

4. The results support the notion that part 
of the discrepancy in results between the 
present study and those of previous experi- 
menters may be attributable to the representa- 
tion of a greater range of attitudes. In the 
experiment of Pintner and Forlano (14) on 
attitude toward patriotism it is likely that 
there was considerable homogeneity in posi- 
verhaps insufficient personal involve- 
Hinck 
ating Ss who placec 1 2 
egory may have eliminated 
the individuals with the strongest attitudes 
on the issue. His procedure would eliminate 
the majority of our Negro Ss and many of 
our pro-Negro white Ss. 

The disproportionately large number of 
extreme does not 
appear to be due to carelessness, as thought 
by Hinckley. Evidence against this explana- 
tion is the fact that Ss who put a large num- 
ber of statements in the extreme anti-Negro 
categories expressed the opinion on a ques- 
tionnaire after the cards were removed that a 
high proportion of the statements were un- 
favorable to Negroes. Further evidence is 
the lawful regularity of the phenomenon 
obtained. 


the scale, with 


tion and | 
: ley’s study the procedure of 
30 or more state- 
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eimuine 
ments in any cate 


sortings in the position 


6. The implications of the present results 
for procedures used in the construction of 


attitude scales are discussed. 
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PERCEPTUAL DEFENSE IN NORMAL AND IN SCHIZOPHRENIC 
OBSERVERS 
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HERE seems to be little question that the 

perceived world of the psychotic differs 

in many respects from that of the nor- 
mal individual. The nature of such a differ- 
ence, however, remains largely a matter of 
conjecture. This is not surprising when we 
consider that our knowledge of the perceptual 
reactions of the average person has not yet 
reached the stage of systematic formulation 
and reliable predictability. Despite the tenta- 
tive nature of our theories concerning the 
affective and conative determinants of nor- 
mal perceptual response, we may nevertheless 
profit from such investigations into the per- 
ceptual behavior of psychotic individuals as 
seem warranted by available knowledge and 
techniques. Indeed, the parallel study of 
perceptual processes in normal and psychotic 
individuals may advance our knowledge of 
both. It is the purpose of the present investi- 
gation to determine whether one such process, 
perceptual defense, operates in schizophrenics 
in a manner similar to that manifested by 
normal observers. 

The term perceptual defense has come into 
use as interpretative of the elevated visual and 
auditory recognition thresholds of observers 
for unpleasantly-toned stimulus words (3, 6, 
g, 10, 13). An attempt to place this clinically 
oriented concept within the general frame- 
work of reinforcement theory has been 
attempted by several writers (8, 9). Other 
investigators (5) have suggested that the 
higher duration thresholds of taboo words 
can be explained in terms of their relatively 
low frequency of occurrence without refer- 
ence to defense mechanisms. 

In a previous experiment, one of us dis- 
covered that male and female observers 
selected from a college population exhibited 
significantly higher thresholds of visual recog- 
nition for emotionally-toned words than they 
did for neutral words of the same length (6). 
Howes and Solomon (5) have questioned 


1 Now at the University of Maryland. 


the interpretation of these findings in terms 
of avoidance behavior by arguing that the 
taboo words employed in the study had lower 
frequency listings in the Thorndike-Lorge 
tables (12) than did the neutral words, and 
hence might be expected to require longer 
exposure for accurate recognition. They fur- 
ther point out that the presence of both male 
and female experimenters, as well as the for- 
mality of the experimental setting, might 
have inhibited prompt veridical report by the 
observers. 

For reasons elaborated elsewhere (7), these 
objections do not necessarily invalidate the 
proposition that observers have greater diffi- 
culty recognizing taboo words than they do 
neutral words. We planned the present ex- 
periment both to check this assumption and 
to compare the recognition behavior of nor- 
mal and schizophrenic observers presented 
with neutral and taboo words under condi- 
tions of tachistoscopic exposure. 


METHop 

Subjects in the experiment were 20 firemen 
employed in Tuscaloosa, Alabama, and 17 institu- 
tionalized males, diagnosed as schizophrenic, from 
Bryce State Hospital in Tuscaloosa. Observations 
were made in the local fire station at which the 
firemen were employed and in a room made avail- 
able to us at Bryce State Hospital.2 By using only 
male observers (Os) with a male experimenter (E), 
and by conducting the experiment in a familiar and 
informal setting for both groups of Os, we hoped 
partially to eliminate embarrassment or diffidence 
as factors contaminating the fidelity of verbal report. 

To control in so far as possible the effect of 
word frequency on recognition, we selected neutral 
words from the Thorndike-Lorge tables which had 
approximately the same frequency of occurrence as 
the taboo words employed. ,As an index of fre- 
quency, we took the average of the Lorge magazine 
and Lorge-Thorndike semantic counts. The words 
used in the experiment are listed in Table 1. Each 
word was typed evenly in capital letters on a roll of 
cash-register paper which was mounted in a Har- 
vard tachistoscope so that the words could be ad- 
vanced seriatim into the viewing field. The pre- 

2 The authors are grateful to Dr. Toombs Lawrence 


who very kindly extended the use of Bryce Hospital 
facilities for a portion of this investigation. 
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+ 
exposure and exposure fields were equated for 
brightness, the value of each being approximately 
.40 foot-candles as measured by a Weston Master 
Il exposure meter. The Os’ level of brightness adap- 
tation thus remained relatively constant throughout 
the experimental session. Thresholds of recogni 
tion were determined. by presenting each word 
initially for .or second and then at durations in- 
creased successively by .or-second intervals until it 
was reported correctly. Threshold determinations 
for three trial words were included to accystom 
the Os to the apparatus and to allow a levelihg of 
practice effect. The order of the neutral and {taboo 


TABLE 1 
Stimutus Worps anp TuHeir THORNDIKE-LoRGE 


Frequencies LisTep IN THE ORDER 
OF PRESENTATION 


Worp * FREQt 


cater 
raped 
fatty 
belly 


needy 


“We » 
Awoouw 


anvil 
whore 
zebra 
hotex ** 


Aw) 


a 


mumps 
stoke 
penis ** 


tripe 


bison 


I. 
4. 
4. 
8 
4 
I. 


* The taboo words are in italics. 

** These words were not listed in the tables and 
were arbitrarily assigned the same frequency as the 
least common of the taboo words. 
words followed the random order previously used 
by McGinnies (6), and the same taboo words were 
employed in order to make results of the two experi- 
ments comparable. Assuming that increments in 
perceptual acuity due to practice are approximately 
linear following the three trial measurements, we 
may note that the order of presentation distributes 
the benefits of practice almost equally between the 
neutral and the taboo words. 

It was impossible to secure a homogeneous group 
of patients. We wished to eliminate those who had 
undergone electroconvulsive therapy, but since this 
treatment is inaugurated almost immediately in the 
case of most admissions, our sample included ten 
patients with a history of electroshock. None, how- 
ever, were undergoing this form of treatment at 
the time of the experiment. Of the 17 patients 
selected as Ss, 11 were diagnosed as dementia 
praecox paranoid, six as dementia praecox catatonic. 

The two groups, firemen and schizophrenics, 
were fairly well matched with respect to age and 
education. Average age of the firemen was 31 
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years, of the patients 32 years. The firemen had 
completed an average of 10.5 grades, while the 
schizophrenics had a mean educational attainment 
of 9.3 grades. 


REsuLTs AND Discussion 
Recognition Thresholds 


The firemen, who constituted a control 
group in the present study, displayed the pre- 
dicted tendency toward higher recognition 
thresholds for the taboo words as compared 
with the neutral words. Although not as 
pronounced as the difference reported by 
McGinnies in the case of college students, the 
trend in the present study yielded a ¢ of 2.16 
and justifies rejection of the null hypothesis 
at better than the .05 confidence level. A 
similar finding was obtained for the patients, 
the difference here giving a ¢ of 3.19, signifi- 
cant beyond the .o1 confidence level. Results 
for all Os are presented in Table 2. In view 


TABLE 2 


THRESHOLDS OF RECOGNITION IN MILLISECONDS OF 
ALL OBsERVERS FOR THE NEUTRAL 
AND Tasoo Worps 


(Tests of significance of differences are included.) 


Group NEUTRAL Tazso0o 
Controls 67.6 75.8 
Patients 99.1 116.6 





of the fact that Thorndike-Lorge frequencies 
of occurrence were balanced for all words, the 
present findings are embarrassing to the posi- 
tion taken by Howes and Solomon (5), who 
maintain that the concept of perceptual avoid- 
ance can be replaced by a response probability 
factor. 

Further inspection of the data reveals that 
the recognition thresholds of patients for both 
the neutral and the taboo words were higher 
than those of the normal Ss. The mean dif- 
ferences here were significant beyond the .or 
level of confidence. These findings are con- 
sistent with observations by Shakow (11) that 
schizophrenic Ss in general have longer audi- 
tory and visual discriminatory reaction times 
than do nonpsychotic individuals. Other 
investigators (2, 4) have reported that schizo- 
phrenics are sluggish in their cardiovascular, 
respiratory, and metabolic reactions to a num- 
ber of stimulating agents. It would seem that 
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certain cognitive as well as autonomic reac- 
tions are retarded in schizophrenia. 

So far as differential thresholds for taboo 
and neutral words are concerned, our psy- 
chotic Os did not differ significantly from 
the normal controls, although their defensive 
behavior was somewhat more pronounced. 
It may be noted, however, that of the four 
reversals of the general defensive trend dis- 
played in the psychotic group, three occurred 
among those six patients diagnosed as “de- 
mentia praecox catatonic.” These patients, 
in other words, showed slightly lower dura- 
tion thresholds for the taboo than for the 
neutral words. As a group, the catatonic 
schizophrenics had lower mean recognition 
thresholds for all words than did the para- 
noids. Breaking down the data for the 
patients in Table 2, we have the paranoid 
group requiring an average of 128.7 ms. 
exposure for recognizing the taboo words, 
103.3 ms. for the neutral words. The cata- 
tonics had a mean threshold of 95.2 ms. for 
the taboo words and 91.5 ms. for the neutral 
words. This observation suggests a further 
experiment in which the recognition behavior 
of individuals diagnosed as catatonic schizo- 


phrenic might be compared with that of 
patients diagnosed as paranoid schizophrenic. 
One might hypothesize that the magnitude of 
perceptual defense to taboo words as well as 
absolute level of recognition for both neutral 
and taboo words would be greater in the case 
of schizophrenics with paranoid tendencies. 


Analysis of Hypotheses 

In order to detect any qualitative differ- 
ences in the development of accurate recog- 
nition by the normal and schizophrenic Ss, 
we recorded all prerecognition ‘hypotheses 
and subsequently grouped them into four 
categories. These categories included (a) hy- 
potheses similar in structure to the stimulus 
word, i.e., words having two or more letters 
correctly placed; (4) hypotheses unlike in 
structure to the stimulus word; (c) nonsense 
hypotheses, i.e., pronounceable groups of let- 
ters having no dictionary meaning; and (d) 
perseverative hypotheses, i.e., those repeated 
during successive exposures of a single word. 
The more striking trends may be stated 
briefly. 

a. Both the normal and the schizophrenic 


835 


Os formed proportionately more hypotheses 
similar to the neutral stimulus words than 
they did to the taboo words. Both groups 
gave more unlike and perseverative hypothe- 
ses when guessing at the taboo words. The 
normal Os reported more nonsense hypothe- 
ses to the taboo words than to the neutral 
words, while the schizophrenics reported 
about the same percentage of nonsense hy- 
potheses in both cases. 

&. For all stimulus words, the normal Os 
formulated proportionately more similar ~hy- 
potheses than did the patients. The schizo- 
phrenics, on the other hand, gave more 
unlike, nonsense, and perseverative responses 
to all words than did the control Os. The 
total number and percentages of responses 
for all Os falling into the various content 
categories are given in Table 3. 

Since the distributions of the above data are 
such as to render them unsusceptible to 
analysis by the usual statistical techniques, 
they are presented as merely suggestive of 
differences in the development of recognition 
in the: two groups of Ss. In general, they 
follow the trends reported in previous studies 
for normal Os (6, 9, 10). 

The tendency on the part of the patients to 
form a preponderance of unlike, nonsense, 
and perseverative responses to both neutral 
and taboo words is consistent with our 
knowledge of the idiosyncratic character of 
psychotic behavior in general. Although few 
studies have been published dealing with the 
recognition aspect of perception in psychotics, 
Angyal (1) has reported that schizophrenics 
are more prone to omissions and substitutions 
than are normal observers in reporting 
tachistoscopically exposed letters. Appar- 
ently schizophrenics are guided less by the 
objective features of the stimulus configura- 
tion than are normals; hence they fail to 
utilize the available cues effectively. Their 
tendencies to perceive nonsense words, to see 
words which bear no structural resemblance 
to the stimulus word, and to perseverate in 
wrong guesses all testify to the heightened 
autistic nature of their perceptual behavior. 


Rate of Hypothesis Formation 


One additional finding is of interest, 
namely, the tendency of the schizophrenic 
Os to generate significantly more hypotheses 





ed 
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to both the neutral and the taboo words than 
did the control Os. This comparison is pre- 
sented in Table 4. Not only did the patients 
require more exposures in order to recognize 
the stimulus words than did the firemen, they 
also engaged in proportionately more pre- 
recognition guessing than did the latter. One 
might describe this tendency on the part of 
the patients as a relatively greater effort after 
meaning or, alternatively, as evidence for lack 
of “control” in this type of problem situation. 
A striking difference appeared between the 


olds for the two sets of words, they were 
closer to the normal reaction pattern than 
were the paranoid patients. For the neutral 
words, the percentage of hypotheses per ex- 
posure for the paranoid schizophrenics was 
32.1, for the catatonic schizophrenics 22.2. 
In the case of the taboo words, the percentage 
for the paranoid patients was 25.0, for the 
catatonics. 14.7. It is interesting in these 
several instances to note that consistent 
gradations of differences exist between the 
paranoids, the catatonics, and the normals. 


TABLE 3 


Tora, Numser oF HypoTHeses FOR ALL OsserverRS FALLING INTO THE SEVERAL CoNTENT CATEGORIES 


(Percentages are in parentheses.) 


Neutra Worps 


GROUF 


Controls 


Patients 


Tasoo Worps 


N 


2 


3 4 
(65.6) (6.9) (24.4) (3.1) 


164 62 106 75 
(40.3) (15.2) (26.1) (18.4) 





extent of this tendency by the patients in the 
recognition of the neutral and the taboo 
words. Their rate of hypothesis formation 
was significantly less for the taboo than for 
the neutral words, as indicated in Table 4. 
A small but statistically insignificant differ- 
ence in the same direction was noted for the 
normal Os. This relative impoverishment of 
hypothesis formation in the case of the taboo 
words may reasonably be interpreted as fur- 
ther evidence for avoidance of recognition of 
these words. With respect both to thresholds 
of recognition and differential rate of hy- 
pothesis formation, the schizophrenics as a 
group displayed greater resistance to recog- 
nition of the taboo words than did the 
normals. 

Again, we may point out a difference be- 
tween the reactions of those schizophrenics 
classified as paranoid and those designated as 
catatonic, even though the subgroups are too 
small to permit reliable statistical evaluation 
of such a difference. The catatonics volun- 
teered fewer hypotheses relative to the total 
number of exposures of both neutral and 
taboo words than did the paranoids. In this 
respect, as in the case of recognition thresh- 


These differences are of degree rather than 
of kind, the catatonics falling closer to the 
normal range in all instances. In view of the 
fact that the prognosis for recovery is gen- 
erally better for catatonic than for paranoid 
schizophrenics, it is tempting to speculate 
that we have discovered a technique for esti- 
mating in a quantitative sense the extent of 
deviation of the recognition behavior of 
patients from the normal pattern. Certainly 
further research on perceptual differences be- 
tween the several diagnostic groups compris- 
ing the schizophrenic syndrome is sug- 
gested by the present findings. 


TABLE 4 


Tora Numer AND PercenTaGE (IN PARENTHESES) 
or Exposures oN Wuicn Hrypotiueses WERE 
VOLUNTEERED BY ALL OBSERVERS TO THE 
NeuTRAL AND Tasoo Worps 


(Tests of significance are indicated.) 








Group NEUTRAL TaBoo t Pp 





137 (10.1) 131 (8.6) -30-.40 
479 (28.4) 407(20.5) 3. <.01 


Controls 
Patients 
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SUMMARY 


The recognition thresholds of twenty nor- 
mal adult males employed as firemen and 
seventeen schizophrenic male patients in a 
state mental hospital were determined for ten 
neutral and seven emotionally-toned words 
balanced for length and frequency. Obser- 
vations were made under informal conditions 
so as to decrease the possibility of reluctance 
by the Os to report the taboo words. Both 
the normal and the schizophrenic Os showed 
significantly higher thresholds of recognition 
for the taboo as compared with the neutral 
words. The general level of threshold for 
both types of words, however, was signifi- 
cantly higher for the patients than for the 
normal controls. 

Evidence for the more compelling role of 
autistic factors in the recognition behavior of 
the patients was observed in their tendency 
to formulate proportionately more nonsense 
hypotheses, hypotheses structurally unlike the 
stimulus word, and to perseverate in wrong 
guesses. All of the Os showed a greater 
tendency to be guided in their prerecognition 
guessing by the structural characteristics of 
the neutral than of the taboo words. 


The schizophrenic Os generated more hy- 
potheses in proportion to total number of 
exposures of both neutral and taboo words 


than did the controls. They volunteered a 
significantly higher proportion of hypotheses 
to the neutral words, however, than to the 
taboo words. Those patients diagnosed as 
catatonic schizophrenic were closer to the 
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normal pattern with respect to thresholds of 
recognition and rate of hypothesis formation 
than were those classified as paranoid 
schizophrenic. 
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EADERSHIP studies have focused in general 
on two major problems, namely the 
distinguishing characteristics of leaders 

and followers and the relationship of differ- 
ential leadership to group functioning. Stog- 
dill (6) has reviewed the major trait studies. 
He reports that 15 or more studies indicate 
that leaders excel nonleaders in intelligence, 
scholarship, dependability and responsibility, 
activity and social participation, and socio- 
economic status. In addition, he reports that 
15 or more studies indicate that the qualities, 
characteristics, and skills required for leader- 
ship roles are largely a resultant of the de- 
mands of the situation. 

Although Stogdill concludes that “leader- 
ship in one situation does not necessarily 
mean leadership in another,” he also points 
out that studies providing the strongest argu- 
ments for the situational nature of leadership 
also provide the strongest evidence for per- 
sistent and relatively stable leadership and 
nonleadership patterns. 

The apparent inconclusiveness of the re- 
searches reviewed by Stogdill might be 
explained on several grounds. (a) The defi- 
nition of leaders in these studies was highly 
variant and in consequence many of the 
investigators were dealing with different phe- 
nomena. (4) The populations studied were 
highly heterogeneous. Some studies dealt 
with nursery school children, others with 
adolescents and some with adults. (c) The 
situations and the groups in which the leaders 
functioned were highly variant. This sug- 

1 The studies to be reported in this and other articles 
under the general title of Studies of Group Behavior 
have been carried on by the Laboratory for Research ‘n 
Social Relations at the University of Minnesota under 
grants from the Graduate School; the College of Science, 
Literature, and the Arts; and from the Carnegie Corpora- 
tion. 

2Members of the Minnesota Laboratory at the time 
of the research. 
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gests that leadership be viewed as an aspect 
of group structure. Thus, the leadership role 
is seen as an aspect of the organization of the 
group and of a specific situation. (d) The 
inconclusiveness of leadership studies also 
might be accounted for on the grounds that 
the leadership phenomenon can be explained 
only by the consideration of both personality 
and situational factors. Such an approach is 
suggested by Hemphill (2). 

One way to clarify the disparity between 
the situational and trait explanations of 
leadership would be to study first the differ- 
ential characteristics of different kinds of 
leaders, and second the distinguishing char- 
acteristics of leaders and other group mem- 
bers in groups with highly similar social 
organizations and operating under highly 
similar conditions. Thus, if a series of groups 
could be studied in which the roles, the 
group purposes, the formal status systems, 
and the shared norms were highly similar, 
then the following problem could be investi- 
gated: Do different kinds of leaders and do 
leaders and other group members differ on 
certain psychological and sociological criteria 
when the group situation is relatively similar? 
The opportunity to investigate this problem 
was afforded in the Village Project of the 
Laboratory for Research in Social Relations, 
since thirteen groups of girls living in a 
university cooperative village interacted in 
situations that in large measure met the above 
specified criteria (1). 


METHOD 


Definition of Leaders, Followers, and Isolates 


Individual girls were seen as playing leadership 
roles in their respective groups if they held the 
office of house president at any time during the 
academic year (formal leadership), or if they re- 
ceived at least one-third or more of the votes of 
their house associates on the second part of the 
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following question, contained in Sociometric In- 
ventory II (administered in May): Name the girls 
who were the outstanding leaders in your house 
during the past school year. Check the ones who 
were most outstanding. Thirty-seven girls were so 
identified. 

Described as isolates were those particular girls 
who appeared in the lowest quartile of at least two 
of the distributions of number of choices received 
on the foilowing sociometric questions: 

Sociometric Inventory I, Question 5. Name the 
girls any one of whom you would like to have as 
a roommate. 

Sociometric Inventory II, Question 6a. Name the 
girls in the village whom you consider as friends. 
b. Check the names of those whom you consider as 
most intimate friends. Section III. List the names 
of those girls in the village whom you can honestly 
say that you like. 

Arbitrarily eliminated from this group of isolates 
were three persons who had already been identified 
as formal leaders and also two girls whose period 
of residence was such as to preclude their being 
mentioned on sufficient criterion questions; a group 
of 23 girls remained. Those individuals identified as 
neither leaders nor isolates, 81 in number, were 
assumed to be followers. 


Comparative Data 


The data for some 100 variables were available 
from the administration of several measuring in- 
struments; their nature and source can be indicated 
briefly by the examples listed below: 

. Age 

. Scores on Minnesota Personality Scale 
. Paternal occupation 

. Peer ratings of contribution to essay contest 
(Sociometric Inventory II, Section Il): “You have 
recently participated in an essay contest. We are 
interested now in obtaining your estimate of the 
contribution of each girl in your house to this 
project. Opposite each name listed below put a 
check mark in the appropriate column: Contributed 
a Great Deal; Contributed Some; Contributed Little 
or None.” 

5. Satisfaction with Village life. “What are your 
attitudes toward your present residence in the Co- 
operative Village? Be as frank as possible. (2) Do 
you feel proud of, indifferent to, or defensive about 
living here? Elaborate. (4) Would you rather be 
living in a dormitory or sorority? (Yes, No) Ina 
private home? (Yes, No) If it were possible, in 
your own home?” (Yes, No) 

6. Self-estimates: physical, motor, emotional, so- 
cial, mental, and personality development and ad- 
justment. 

Emotional Development and Adjustment. “On 
each of the following (20 items), rate yourself with 
respect to your associates (on a five-point scale): 
tendency to get angry, although controlled; tend- 
ency to ask others for advice; tendency to worry; 
ability to see self objectively; ability to accept 
your own blunders without blaming others.” 

7. Participation in family activities: “Referring 


to the two general periods of your life mentioned 
below (as a child, in high school), describe the 
activities in which all or most of the members of 
your family participated as a group. Encircle the 
appropriate letter: L, a lot; S, some; N, never at 
all (26 items); discussing topics of general interest, 
not at table; singing or playing music together; pic- 
nics; card (bridge, rummy, etc.) or board (check- 
ers, monopoly, etc.) games; visiting at friends’ 
homes, as a family group.” 


REsuULTs 


The analysis of variance technique was 
used in those cases where the assumption of 
homogeneity of variance could be satisfied 
to establish whether or not the differences 
among the three groups of leaders, followers, 
and isolates for any one variable were sta- 
tistically significant. Only those comparisons 
which yielded positive results in the sense 
that the pertinent null hypotheses were dis- 
proved in part or in full will be reported. 


Differences Among Types of Leaders 


Two criteria were used to identify leaders, 
it may be recalled: election to office (formal 
leadership) and being perceived as a leader 
by at least one-third of their associates (in- 
formal leadership). On the basis of these two 
independent criteria, it should be possible to 
identify three mutually exclusive categories 
of leaders: those individuals who are formal 
but not informal leaders; those who are both 
formal and informal leaders; those individ- 
uals who are informal leaders but not formal 
leaders. An analysis of the 37 leaders showed 
that there were 24 girls in the first category, 
6 girls in the second, and 7 in the third. 

On only two variables was it possible to 
differentiate among types of leaders. They 
differ in socioeconomic status as indicated by 
paternal occupation classified by the Minne- 
sota Scale of Paternal Occupations (3). The 
differences are significant at the .05 level 
(Table 1). The mean ratings of the three 
groups of leaders are as follows: formal, not 
informal—2.79; formal, informal—2.20; infor- 
mal, not formal—4.28. A difference, signifi- 
cant at the .or level, is found between the 
means of the two groups of informal leaders. 
Those individuals perceived as being leaders 
but who are not formal leaders have fathers 
in the lower occupational classifications. 

Types of leaders differ in their self-esti- 
mates of emotional development (Table 2). 
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TABLE 1 


ANALYsIS OF VARIANCE OF ScorES ON SCALE FOR 
PATERNAL OccUPATIONS OF THREE GROUPS 
or LEADERS 


SuM OF 
SQUARES 


DEGREES OF 
FREEDOM 


SOURCE OF 
VARIATION 


15.55 
51.39 
66.97 


F=4.23:p—.05 


Between groups 
Within groups 
Total 


Those individuals who are both formal and 
informal leaders have a significantly better 
adjustment score (.04 level) than either of 
the other two types of leaders. The mean 
scores are as follows: formal only, 61.82; both 
formal and informal, 55.20; informal only, 
64.00. (The lower the score the better the 
adjustment.) 

All other null hypotheses concerning dif- 
ferences among types of leaders were not 
disproved, even when groups were recom- 
bined in such ways as to afford comparisons 
between all formal leaders and all informal 
leaders. 


Differences Among Leaders, Followers, and 
Isolates 


Certain null hypotheses with regard to 
differences among leaders, followers, and 
isolates and the extent to which they are 
disproved by the present data may now be 
examined. 

Hypothesis: Individuals who play respec- 
tively the roles of leaders, followers, and 
isolates do not differ in the social class with 
which they identify or in the extent of that 
Almost all individuals identi- 
Leaders tend to 


identification. 
fied with the middle class. 


TABLE 2 


ANALYsis OF VARIANCE OF Scores ON SELF-EsTIMATE 
or EmMotionAL DevELOPMENT OF THREE 
Groups oF LEADERS 


MEAN 
SQUARE 


DEGREES OF SUM OF 


FREEDOM 


SOURCE OF 
VARIATION SQUARES 


Between groups 242.99 121.50 
Within groups 1092.07 


Total 
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identify to a greater extent than either fol- 
lowers or isolates with middle-class attitudes 
(Table 3). No differences exist between 


TABLE 3 


ANALYsIS OF VARIANCE OF ScoRES ON EXTENT OF 
IDENTIFICATION WITH Mrppie-CLass ATTITUDES 
oF Leavers, FotLowers, AND IsoLaTEs 


MEAN 
SQUARE 


DEGREES OF SuM oF 


FREEDOM 


SOURCE OF 
VARIATION SQUARES 


126.83 80.63 
1452.50 11.32 


Between groups 2 

Within groups 131 

Total ' 133 1609.33 
F>=7.12:p—.01 





followers and isolates. The mean “middle- 
class” scores on the Social Attitudes Scale (5) 
for the three groups are: leaders, 27.65; fol- 
lowers, 25.66; isolates, 25.05. 

Hypothesis: Leaders, followers, and iso- 
lates do not differ in the extent to which the 
members of their respective families, includ- 
ing themselves, participated as a group in 
family activities prior to entrance into college. 
Individual girls who were leaders indicate a 
greater number and a greater frequency of 
family activities than do either of the re- 
maining two groups, between which there is 
no significant difference (Table 4). The 


TABLE 4 


ANALYSIS OF VARIANCE OF FREQUENCY OF FAMILY 
Activities of Leavers, FoLLowers, AND IsoLaTEs 


MEAN 
SQUARE 


SUM OF 
SQUARES 


DEGREES OF 
FREEDOM 


SOURCE OF 
VARIATION 





619.27 309.64 
95.06 


Between groups 2 

Within groups 113 10,741.31 

Total 115 11,360.58 
F=3.26:p—.05 





average scores for the groups are: leaders, 
13.35; followers, 18.47; isolates, 18.28. (The 
score is the number of types af activities in 
which the individual never engaged.) 
Hypothesis: Leaders, followers, and iso- 
lates do not differ in their estimates of them- 
selves. Leaders are superior, but only with 
respect to self-estimates of proficiency in a 
variety of motor skills (Table 5). Not even 
for this type of self-estimate, however, is there 
any difference between followers and isolates. 





Leavers, FoLLowers, AND IsoLares IN SMALL Groups 


TABLE 5 


ANALYsIs OF VARIANCE OF Scores ON SeLF-EsTIMATE 
oF ProricreNcy 1n Motor SKiLts cF LEADERs, 
FoLLOWERS AND ISOLATES 


MEAN 
SQUARE 


SuM OF 
SQUARES 


DEGREES OF 
FREEDOM 


SOURCE OF 
VARIATION 


428.97 214-49 


55-53 


Between groups 2 
Within groups 113 
Total 115 
F=3.86:p—.05 


6275.18 
6704.15 





The mean scores for leaders, followers, and 
isolates are respectively 44.96, 40.75, and 40.67. 

Hypothesis: There are no differences 
among leaders, followers, and isolates with 
respect to chronological age. While the data 
indicate that leaders are younger than either 
followers or isolates, the fact that the vari- 
ances of the three groups are not homo- 
geneous made it impossible to apply the 
analysis of variance to these data. The mean 
ages are as follows: leaders, 22.3; followers, 
24.2; isolates, 26.5. 

Discussion 

Regarding the three types of leaders, the 
only differences were in the factors of pater- 
nal occupation and self-estimate of emotional 
development. In the former case, the indi- 
vidual who was formally elected to a leader- 
ship position in her group came from the 
higher socioeconomic levels. Those girls who 
were seen as playing informal leadership roles 
only tended to come from the lower socio- 
economic levels. 

With regard to self-estimate of emotional 
development, the individuals who perceived 
themselves as most mature were those who 
were both formal and real leaders. Whether 
this superiority in self-estimate came as a 
result of recognition as a leader by the group, 
or whether such individuals were actually 
recognized as more mature and therefore 
selected as leaders cannot be determined from 
the data. While it is true that individuals 
who indicated less maturity in terms of emo- 
tional adjustment were also considered in- 
formal leaders, they did not seem to be able 
to strengthen their positions through official 
elective recognition by their associates. 

The fact that leaders, without respect to 
type, identified to a significantly greater 


841 


extent with middle-class standards indicates 
that one aspect of the leadership role is a 
strong identification with the social norms 
which are held in some degree by all the 
group members. That leaders claimed a 
greater experience in family activities 
throughout precollege life might indicate that 
such experiences have played a part in the 
development of mature individuals who are 
likely to become leaders. , That leaders had a 
higher regard for their own levels of skill in 
various kinds of motor activities suggests 
that the same pattern of motivation and per- 
sonality which leads individuals early into 
active participation and learning in motor 
areas is also likely to put them into active 
roles in group activities that do not directly 
involve motor skills. 

The paucity of significant differences 
among types of leaders and leaders, followers, 
and isolates found in these groups with 
highly similar formal organizations casts 
serious doubt on the trait explanation of 
leadership for the kind of groups studied. 
This conclusion suggests that a more effective 
attack on the problem of the determinants of 
leadership may be found in the role ap- 
proach to leadership (4), or in viewing 
leadership as a part of the organization of 
the group. 

It is necessary to reconsider certain premises 
in the present study. It was assumed that the 
13 groups were of similar origin, organiza- 
tion, and explicit goal, and that their respec- 
tive memberships did not appear to differ 
significantly in certain variables. It was fur- 
ther assumed that the identically named 
roles within the respective groups were socio- 
logically identical as well, and that the indi- 
viduals who played those roles were also 
similar. Hence, in this special case, it was 
believed that leaders could be compared with 
followers and with isolates as individuals 
without regard for the particular group origin 
or context. Only if these assumptions are 
correct can the data support a theoretical 
position that each role, in large part, must be 
specific to the group. 


SUMMARY 


Based on the frequency with which they 
were chosen on several sociometric criteria 
as well as the fact of official recognition by 
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election to ‘office, the total population of a 
midwestern university . cooperative village 
consisting of 13 residence units was divided 
into three groups: leaders, followers, and 
isolates. A study was then made of the dif- 
ferential characteristics of the individuals in 
these groups with data on some 100 variables. 
Leaders were found to be superior to both 
isolates and followers with respect to the 
degree to which they identified with the 
standards of the middle class (the class with 
which practically all the individuals in the 
village identified), the frequency with which 
the families of the respective girls participated 
as a whole in certain kinds of activities prior 
to entrance into college, and self-estimates of 
proficiency in motor skills. It was concluded 
that a group was more likely to recognize as 
a leader the individual who conforms most 
closely to the standards of the group, who has 
had most experience in the intimate group 
life of the family and who regards herself as 
proficient in motor skills, presumably by rea- 
son of extensive experience and training in 
that area. 

On the basis of whether they were formal 
leaders—that is, officially elected to office by 
their respective groups—or whether they 
were informal leaders, individuals perceived 
by at least a third of the group membership 
as being leaders, the group of leaders was 
further divided into three subgroups, consist- 
ing of formal but not informal leaders, both 
formal and informal leaders, and informal 
but not formal leaders. The characteristics 
of these individuals were also compared. 
Only two significant differences were found. 
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Those individuals who were informal but not 
formal leaders came from a lower socioeco- 
nomic background than did either of the 
other two types of leaders, suggesting that an 
individual to be recognized both as an infor- 
mal and formal leader must have at least a 
minimum socioeconomic status. Formal and 
informal leaders gave a significantly higher 
estimate of their own emotional development 
than did the other types of leaders; it was 
concluded therefore that an informal leader 
to be recognized also as a formal leader must 
have the self-confidence that comes with 
seeing oneself as emotionally mature, relative 
to others. 

The paucity of differences found tends 
to negate the trait approach to leadership 
and suggests the utilization of other frames 
of reference in the study of leadership 
phenomena. 
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ANY present-day personality psycholo- 

gists and psychometricians are heir 

to a trait or faculty theory of human 
behavior. The fond hope of some of them 
is the discovery of a set of traits or faculties 
which are capable of predicting the behavior 
of individuals. They have carried the search 
for stable traits into the realm of constitu- 
tional types, biologically rooted motives, neu- 
rotic complexes, primary mental abilities. 
These psychologists are faced with the dis- 
turbing knowledge, gained from common 
sense and empirical evidence, that the situ- 
ation in which the individual is placed often 
determines his behavior, at least in part. 
Anthropologists, for instance, have demon- 
strated that the social behavior of an individ- 
ual is to a large extent determined by the 
culture and the social class to which he 
belongs. As a result, there is increased aware- 
ness among personality psychologists and 
psychometricians of the need to take into 
account the situation in which individuals 
operate. 

This trend can be seen in the development 
of situational tests by the personnel of the 
Office of Strategic Services during World 
War II to assess candidates for the Service. 
These situational tests were distinguished by 
the fact that a person’s qualifications were 
tested in a number of different situations. 
Previously, the general practice was to devise 
a test—often a paper-and-pencil test—which 
was conceived to measure a trait applicable to 
any situation. In the defense of the use of 
situational tests, the OSS Assessment Staff (4, 
p. 35) claimed that their use led to a more 
accurate study of a man’s varied components 
of personality: 

Underlying our recommendation for the inclusion 
of many varied procedures is the well-accepted fact 


that in order to formulate a personality one must 
know many of its components, and therefore, since 


1 The writer wishes to express appreciation to Rine- 
hart and Company for permission to use data from 
Assessment of Men (4) for this analysis and to Dr. 
Evelyn Raskin and Dr. C. F. Wrigley for helpful criti- 
cism of the manuscript. 


in a single event only a relatively few components 
are exhibited, the more events of which we have 
accurate reports the better. Also, since a man reacts 
differently to different situations, we need reports 
of a wide variety of events in which the subject has 
participated. As a rule, the more varied the situ- 
ations, the more varied will be the components of 
personality which are evoked. The conclusion is 
obvious: to arrive at a conception of the different 
systems and different resources of a man’s person- 
ality, one must discover his emotional responses to, 
and effectiveness in dealing with, different kinds of 
situations. 


The magnitude of the task undertaken by 
the OSS staff in assessing candidates in three 
days was staggering. In addition to interview 
material, biographical data, and casual obser- 
vations, it gathered ratings on ten major traits 
in an average of six different situations per 
trait. This made a total of some 60 ratings 
to be considered in assessing the candidates’ 
qualifications. The trait names and the num- 
ber of situations in which they were rated 
were as follows: 

1. Energy and Initiative, 7 

. Effective Intelligence, 12 

3. Emotional Stability, 4 

. Social Reiations, 8 

. Leadership, 7 

. Physical Ability, 4 

. Security, 4 

3. Observing and Reporting, 7 

g. Propaganda Skills, 5 

10. Motivation for Assignment, 2 
Candidates were assigned to subgroups of 
five to seven members and were rated by a 
team of three staff members. Ratings of the 
staff members on a six-point scale were 
pooled by a discussion procedure to arrive at 
a unanimous final over-all rating. 

Examination of the tables of correlations 
published in Assessment of Men (4, pp. 516- 
524) indicates that the ratings on the so- 
called traits for an individual varied to some 
extent from situation to situation. A team of 
three staff members rated, for example, 
Candidates A, B, C, D, and E for Leadership 
in seven different situations. Now, if we 
define a trait as the rating assigned by staff 
members on a six-point scale a strict trait 
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theory would predict that each individual 
would maintain the same relative standing 
trait even when the situation is 
changed. Alteration in test situation might 
raise or lower his rating somewhat, but pre- 
sumably these effects would be constant for 
all individuals. Therefore, according to a 
strict trait theory the correlation of trait rat- 
ings between any two situations should be 
perfect, except for errors of measurement and 
specific factors peculiar to each situation. If 
the correlations are corrected for uniqueness 
(i.e., for errors of measurement and specific 
factors), however, the predicted correlation 
would be a perfect one. On the other hand, 
if the corrected correlations are less than 
perfect it means that change in test situ- 
ation affects an individual's relative standing 
within the group on the trait in question. 
Factor analysis of a table of correlations 
among trait ratings in different situations will 
indicate the extent to which correlations cor- 
rected for uniqueness are perfect. If they 
are, a single factor will account for all of the 
correlations. If they are not perfect, more 
than one factor will emerge. The number 
of factors indicates the number of different 
dimensions into which the situations spread 
out. The clusters of situations which are 
most widely separated ‘ndicate the different 
kinds of situations which affect a trait rating. 
In order to establish the number and kinds 
of different situations which affected ratings 
of traits the present writer undertook simpli- 
fied factor analyses of tables of correlations 
of four traits which were rated in sufficent 
numbers of different situations. These traits 
were Effective Intelligence, Social Relations, 
Energy and Initiative, and Leadership. 


on the 


Metuop oF Factrorinc AND Drawinc GRAPH 
» procedure for factoring and drawing graphs 

\odification of R. C. Tryon’s cluster and 

nal analysis (7). A fairly complete set of 
reproduced for one variable, Social Rela- 
since this gives a better idea of the method 
than presenting a few tables for all four waits 
(Table 1). Factor loadings and communalities, 
however, for the other three variables are given in 
Table 2. Calculations were originally carried out to 
aces, but were rounded off to two 


es 1S 


tions, 


three decimal pl 
places to conserve space. 

The factoring was done by inserting the highest r 
in the diagonal cells and by first finding the most 
independent clusters. This was done by matching 
the profiles of the highest one-half of the correla 
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tions, beginning with the most independent variable 
(defined as the variable with the lowest =r/A). 
A second estimate of the communalities, shown in 
Table 1-B, was obtained from the squared loadings 
of a variable on the centroid vector of its own 
cluster. Mean correlations were calculated between 
clusters and between clusters and variables, includ- 
ing all of the variables as Cluster T (Table 1-C). 
The extraction of factors was accomplished by 
passing the first axis through the centroid of all of 
the variables (Cluster T). This procedure is the 
same as the first step of Thurstone’s complete 
centroid method, and was adopted because it was 
ideal for obtaining a spherical graph. The second 
centroid axis was passed through the cluster most 
independent of the first (Cluster A, with the great- 
est residual communality) and orthogonal to it. 
The third centroid axis was passed through the 
cluster with the greatest residual communality 
(Cluster B) and orthogonal to the first two axes 
(see Table 1-C and 1-D). Extraction of factors 
was stopped when the communalities of clusters 
were accounted for. This procedure is essentially 
the diagonal method of factoring applied to the 
correlation matrix of clusters. It is similar to the 
method of factoring to which Thurstone (6) refers 
as the multiple-group method. 

The original factor axes were not rotated because 
it was deemed unnecessary for present purposes. 
Interpretation was based on the spherical graph and 
on the correlation among clusters. For Social Rela- 
tions, the calculation of r’s from factor loadings 
showed 28 out of 36 with a discrepancy with orig- 
inal r’s of .o2 or less, and only one with a dis- 
crepancy greater than .05. 

To draw the spherical graph, factor loadings were 
normalized by dividing by the square root of the 
communality of the variable (Table 1-F). This 
insured that each vector reached the surface of the 
sphere. When the normalized factor loadings of 
two axes, other than the first centroid axis, are 
plotted, an excellent view of the relationship among 
the variables is obtained (see Fig. 1). Except for 
some distortion due to the curvature of the sphere, 
distances between points represent the angular 
separation of vectors or the correlation between 
variables corrected - for uniquenes For Social 
Relations, correlations corrected for uniqueness are 
shown in Table 1-A. To simplify the graph the 
final over-all rating, 13, was not plotted and clus- 
ters were not identified. The corners of the 
spherical right triangle were obtained from the fol- 
lowing Landahl’s coordinates: 

I III 
A -577 .8165 
B 577: — .4082 
Cc 5773 — .4082 
The arcs were drawn in by placing one point of 
the compass with a radius of 1.5576 on the cir- 
cumference of the circle. Since the positions of 
the centroid axis and the first cluster are fixed, the 
remaining clusters tend to fall into a simple struc- 
ture following the form of the spherical right 
triangle. 





Factor Anatysis oF OSS Siruationat Tests 


RESULTS 


The results of the factor analyses can best 
be presented in graphical form (see Fig. 1). 
Each situation is represented as a point on 
the surface of a sphere in order to portray at 
lease three dimensions. The arcs drawn into 
the graph form a spherical right triangle and 
represent points of zero correlation with a 
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crosspoint opposite it. The arc BC, for 
instance, represents points of zero correlation 
with crosspoint A. The distance between 
any two points indicates degrees of correla- 
tion corrected for uniqueness between ratings 
of a trait in different situations, perfect cor- 
relation being represented by exact coinci- 
dence of two points. 


TABLE | 


Data For SoctaLt RELATIONS 








A. CorRELATION Matrix Groupep sy CLUsTERs ® 


CLUSTER 





VARIABLE 





Negative Rating 

Positive Rating 

Final Over-all Rating 

Construction .16 
Assigned Leadership .16 
Brook -24 
Debate .58 
Discussion 36 
Interview -54 


86 
.98 -95 
58 -67 
.39 -gI 
+94 -95 





B. Estimates or CoMMUNALITY, SQUARED Loapincs on CLusTER CENTROID 


-45 .58 





.67 -29 -49 -49 





C. Mean Correcartions (r) 


BerTween CLUSTERS AND VARIABLES 





Cc 
of +39 
-40 35 
35 +47 

41 





.0o 


re—re—. 5501 — —.25a2 





Unroratep Factor Loapincs (a) 


-45 .61 


56 


-gl 
.05 





. Normavizep Unroratrep Factor Loapines (a’) 


-79 
4 .s8 
-17 .18 


/ 


A=V a as as 


* Above diagonal: original correlations, rsx. 


06 —.44 


—.02 -25 -17 


Below diagonal: 


.00 . 86 -90 


6 —.39 


, 
a’s*+a'.?+a’',?=1.00 





correlations corrected for uniqueness, rjx/hy Ar. 
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Let us first examine the interrelationships 
among the ratings of the trait of Effective 
Intelligence in 12 different situations. From 
Fig. 1-A it can readily be seen that the cor- 
relations between some of the situations are 
low. Around corner A of the triangle we find 
a group of three closely related situations: 
Brook, Assigned Leader, and Assigned Sub- 
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general kinds of situations rather than in a 
single general situation. 

Let us now examine the nature of the three 
different kinds of situations, taking up first 
the cluster of situations near A. In the Brook 
situation, groups of men were required to 
carry out joint tasks, such as moving a log 
over a brook without dropping it into the 


ordinate. These three have low correlations stream. In the Assigned Leader situation a 


TABLE 2 


Unrortatep Factor Loapincs * 











A. EFFrective INTELLIGENCE 
8 I 





48 83 
-O!I -00 
-50 —.14 
48 -71 





B. ENERGY AND INITIATIVE 


7 9 








.56 .69 

—-44 -45 -44 —-37 
—.03 —.06 .05 -14 
31 -51 -50 - 63 








C. LEADERSHIP 


9 


16 





I e ‘ -68 
Il ° a —.12 
h? of , .48 


65 
-17 
-45 








* Variables: 
1. Interview 6. Discussion 

. Vocabulary 7. Debate 
. Nonverbal 


. Ous 9. Brook 


11. Assigned Subordinate 
12. Judgment of Others 


8. Mechanical Comprehension 13. Final Over-all Rating 


14. Construction 


. SIX—2 10. Assigned Leader 15. Obstacle 


with all other situations, except Interview, 
which is close to the center. Around B. we 
find another cluster of situations consisting 
of Mechanical Comprehension, Nonverbal, 
Otis, and SIX-2. Around C we find still 
another cluster consisting of Debate, Discus- 
sion, Judgment of Others, and Vocabulary, 
to which Interview is most closely associated. 
It is clear that at least three dimensions are 
necessary in portraying the relationships 
among the ratings of Effective Intelligence in 
12 different situations. In other words, in 
speaking of Effective Intelligence it is neces- 
sary to consider it in the context of three 


16. Rating by Associates 


candidate was assigned as a leader to direct 


a group in such tasks as crossing a mined 
road to escape from enemies. In the As- 
signed Subordinate situation individuals as- 
signed roles as subordinates to the leader 
were rated on Effective Intelligence. Com- 
pared to other situations these situations have 
in common working with other people in an 
active situation. These can therefore be 
called “social-active” situations. 

In the cluster of situations near B, Non- 
verbal was a paper-and-pencil test situation, 
involving a candidate’s taking a nonverbal 
test of intelligence. Otis was similarly an in- 
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telligence test of a verbal nature, the Otis 
Self-Administering Test of Mental Ability. 
SIX-2 was also a paper-and-pencil test which 
required the reading and analyzing of four 
military documents and the writing of a 
fifty-word dispatch based on them. Less cor- 


a oyscus. 


DEBATE 


A, EFFECTIVE INTELLIGENCE 


C. ENERGY AND INITIATIVE 


Fic. 1. 
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They can be termed “individual test” situa- 
tions. 

Near the remaining corner C, we find the 
Debate situation. This was a debate between 
opposing teams of men on such controversial 
issues as “What shall we do with Germany 


O@eRooK 
Y 
4 
@consr. 


@ass 


‘ 05.0 
LEADER, RATING 


+ RATINGS 
@INTERV. 


&, @oiscus. 


or) 


ty @oesare 


8. SOCIAL RELATIONS 


D. LEADERSHIP 


SPHERICAL GRAPHS OF INTERRELATIONSHIPS OF SITUATIONAL TESTS FOR 


Four Traits 


Situations: Assigned Leader, Assigned Subordinate, Brook, Construction, 
Debate, Discussion, Interview, Judgment of Others, Mechanical Comprehen- 
sion, Negative Rating by Associates, Nonverbal, Obstacle, Otis, Positive Rating 
by Associates, Rating by Associates, SIX-2, Vocabulary. 


related, but still in the same general area is 
the Mechanical Comprehension situation. 
This also consisted of a paper-and-pencil test, 
the Bennett Mechanical Comprehension Test, 
which tested the ability to understand physi- 
cal principles in everyday situations. It is 
significant that these situations are all paper- 
and-pencil tests of intelligence, which are de- 
void of social interaction and physical activity. 


after the war?” The Discussion situation 
was similar to the Debate situation, with the 
exception that problems such as “What are 
the major postwar problems facing the United 
States?” were discussed by a group of candi- 
dates sitting around a table. Judgment of 
Others and Vocabulary were highly corre- 
lated with the “individual test” situations. 
The Judgment of Others situation was the 
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staff rating of personality sketches written by 
candidates. This was an attempt to appraise 
candidates’ psychological insight into their 
fellow candidates. The Vocabulary situation 
was a multiple-choice vocabulary test, a paper- 
and-pencil test. With the exception of the 
Vocabulary situation, which could be better 
conceived as an individual test situation, these 
situations represented situations involving 
both verbal ability and social interaction. 
Most typical of this type of situation were 
the Debate and Discussion situations. The 
Judgment of Others situation involved writ- 
ten verbalization of a social situation. These 
situations can therefore be termed “social- 
verbal” situations. 

The Interview situation was correlated to 
some extent with all three types of situations, 
with the highest correlation with the “social- 
verbal” situation. The Interview situation 
consisted of ratings based on a lengthy clini- 
cal interview covering the past history of the 
individual, his interpretations of these events, 
as well as behavior in the interview situation. 
In addition, the interviewer had at his dis- 
posal other documents, such as the Personal 
History Record, the Health Inventories, the 
Projective Questionnaire, the Sentence Com- 


pletions, and the (preferred) Work Condi- 
tions Survey of each candidate. 

The OSS Assessment staff conceived Effec- 
tive Intelligence as a broad trait covering at 
least the following abilities: 


1. Collection of data 
2. Diagnosis of the situation 
3. Conception of plan 


4. Execution of action. 


It is clear from our analysis that when 
broadly conceived in this manner intelligence 
must be considered with respect to at least 
three different kinds of situations: the “social- 
active,” the “individual test,” and the “social- 
verbal” situations. A single over-all rating of 
Effective Intelligence would be somewhat 
misleading inasmuch as it would not reflect 
very accurately a person’s standing in the 
three general types of situations. On the 
other hand, if the scores on three general 
types of situations were calculated, they 
would account for the common-factor vari- 
ance measured in 12 situations. Such scores 
representing rating in each of the three dif- 
ferent types of situations could be easily cal- 
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culated by averaging the ratings of the cluster 
of variables most representative of the type 
of situation. 

The factor analysis of the ratings on Social 
Relations in eight different situations reveals 
a similar result, as shown in Fig. 1-B. Social 
Relations referred to the ability of the candi- 
date to get along with others. Here, as in the 
case of Effective Intelligence, we find a dis- 
tinction between the “active” and “verbal” 
situations. The “active” situations were repre- 
sented by the Brook, Construction, and As- 
signed Leader situations. The Construction 
situation was an outdoor situation, involving 
a éandidate’s working with a pair of OSS 
staff members who continually obstructed his 
task of building a wooden structure. The 
“verbal” situations were represented by the 
Debate, Discussion, and Interview situations. 
A third type of situation included Positive 
and Negative Ratings by Associates. In these 
situations the candidates were rated by fellow 
candidates as to desirability or undesirability 
as a co-member in a group. Why these rat- 
ings correlate relatively low with the ratings 
of staff members on the same candidates on 
the same traits (but in other situations) is not 
clear. The ratings of staff members were 
made in more or less formal situations of an 
“active” or “verbal” nature. It is possible 
that the candidates rated their fellow candi- 
dates with the more informal friendship situ- 
ation in mind. It should be noted that the 
correlation among different types of situa- 
tions is not as low as for Effective Intelli- 
gence. Nonetheless, for the most parsi- 
monious description of individuals the trait 
of Social Relations would need to be con- 
sidered in the context of three different types 
of situations. 

The factor analysis of the trait of Energy 
and Initiative in Fig. 1-C reveals two familar 
types of situations. On the one hand, we find 
the Debate and Discussion situations repre- 
senting “verbal” situations. On the other, 
the Obstacle and Brook situations represent 
“active” situations. The Obstacle situation 
was an obstacle course which involved swing- 
ing, climbing, jumping, crawling and which 
was designed to test physical agility and dar- 
ing. The Assigned Leader and Interview 
situations fell between the “verbal” and “ac- 
tive” situations. The beginning of a third 
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type of situation is represented by the Con- 
struction situation, but is poorly defined for 
lack of other situations in this area. The 
trait of Energy and Initiative therefore should 
be considered in the context of two different 
types of situations, which have low correlation 
with one another. 

Only two dimensions were necessary to 
plot the relationships among the seven situa- 
tions in which Leadership was rated, as 
shown in Fig. 1-D. Of the four traits, Lead- 
ership came closest to being a unitary trait. 
Even here, however, it is possible to distin- 
guish the “verbal” situations—Debate and 
Discussion—and the “active” situations— 
Construction, Assigned Leader, and Brook. 
The ratings in the Interview situation cor- 
respond more closely to the “active” than to 
the “verbal” situation, and the Rating by 
Associates situation is more closely related to 
the “verbal” situations. 

From the above four factor analyses it is 
evident that at least from two to three differ- 
ent general kinds of situations must be con- 
sidered in the interpretation of the ratings of 
candidates on the four traits. 

In Fig. 2 is shown the result of the factor 
analysis of the over-all ratings on ten traits, 
since it is pertinent to our discussion. Data 
for this graph were obtained from a factor 
analysis reported by the OSS staff in Assess- 
ment of Men (4, Table 56, Centroid Matrix, 
p. 515). Factor loadings on Centroid Axes 
II and III were normalized and plotted 
against each other. A fourth dimension had 
been extracted, but it was so poorly defined 
and accounted for so little of the common- 
factor variance that it has been abandoned 
for the purpose of discussion here. Near A 
we find the cluster of traits, Observing and 
Reporting, Propaganda Skills, ard Effective 
Intelligence. All of these represent intellect- 
ual ability and can be called intelligence. 
Near B we find the traits of Security, Social 
Relations, and Emotional Stability. Security 
represented ability to keep secrets, caution, 
discretion, ability to bluff and to mislead. 
Security and Emotional Stability correlated 
highly with Social Relations, and they can be 
interpreted to measure in common social ad- 
justment. Near C we find Physical Ability 
and Energy and Initiative, which can be in- 
terpreted to measure in common physical 
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energy. Leadership is closely associated with 
physical energy and also with intelligence. 
Motivation for Assignment is associated with 
both physical energy and social adjustment. 
Three general traits—intelligence, social ad- 
justment, and physical energy—can be em- 
ployed to account for most of the common- 
factor variance measured by ten traits. 


Discussion 


One of the conclusions to be drawn from 
this series of factor analyses is that in assess- 
ing the qualifications of individuals it is nec- 
essary to consider the kinds of situations as 
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well as kinds of traits. In other words, a 
trait is not meaningful unless it is considered 
in the context of a kind of situation. 
Roughly, the OSS study indicates that the’ 
ratings of different traits in different situa- 
tions calls for the consideration of at least 
two general kinds of situations: the “active” 
situation of the Brook and Assigned Leader 
sort and the “verbal” situation of the Debate 
and Discussion sort. In addition, it is neces- 
sary to consider some situations for some 
traits but not for others. For Effective Intel- 
ligence, for instance, in addition to the “ac- 
tive” and “verbal” situations, it is necessary 
to consider the “individual test” situations. 
For Social Relations, the Rating by Associ- 
ates situation must also be considered. The 
number and kinds of general situations which 
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can be discovered by factor analysis, of course, 
are limited by the number and kinds of situa- 
tions in which a trait is originally rated. The 
distinction between “verbal” and “active” sit- 
uations, however, is a promising one. Prob- 
ably most psychologists would consider this 
to be a reasonable distinction from a com- 
mon-sense point of view. In addition, its 
general applicability is demonstrated by the 
effect on the ratings of all four traits which 
were analyzed. The study of situational tests 
should be continued, using a greater variety 
of situations as well as a greater variety of 
traits to check on the validity of the notion of 
general types of situations. 

It is possible to maintain a strict trait theory 
by arguing that a different trait is being 
measured when the kind of situation is 
changed. Effective Intelligence, for example, 
can be conceived to include within it three 
kinds of intelligences: individual-test intelli- 
gence, social-verbal intelligence, and social- 


active intelligence. ‘This, in fact, is a com- 


mon way of phrasing variation in behavior 
from situation to situation. To maintain such 
a position, however, it is necessary to con- 
ceive of the simple rating scale as measuring 


as many different traits as there are kinds of 
situations or to regard the rating scale as 
fallible and incapable of objective measure- 
ment of a trait. Since human judgments 
are involved in making ratings, this possibil- 
ity must be kept in mind. On the other 
hand, since the same team of three judges 
was used to judge groups of subjects simul- 
taneously one would expect a “halo” effect. 
This would increase the correlations among 
ratings in different situations and work 
against our finding low correlations and dif- 
ferent kinds of situations. But, of course, we 
cannot be sure of this. Further experimenta- 
tion with the rating scale is desirable on this 
point. 

Whether one should accept situational fac- 
tors or should maintain a strict trait theory 
in view of knowledge of variation in ratings 
on a trait from situation to situation is a 
theoretical problem. The strict trait theory 
is, in one respect, very similar to the strict 
stimulus-response theory of behavior. Both 
reject the need for the consideration of an- 
other source of variation. The main dilemma 


with which both positions are faced is that 
they cannot be reconciled. The trait theorist 
seeks his dynamics within the organism; the 
stimulus-response theorist outside of the or- 
ganism. There is ample evidence that both 
the traits of an organism and the nature of 
the situation are sources of variation in be- 
havior. To the present writer, who leans 
toward a field-theoretical point of view, it ap- 
pears necessary to accept both the organism 
and the situation, rather than to reject one 
or the other. Consequently, his conclusion 
on the failure of a single factor to account 
for the correlations of ratings on a trait in 
different situations is that kinds of situations 
must be taken into account in considering 
traits. 

The factor analysis of the ten traits pro- 
vides another basis for conclusion. Since the 
traits represent pooled over-all ratings based 
on the observation of candidates in a number 
of different situations, the ratings more or 
less ignore the difference we have found to 
exist from situation to situation. Further- 
more, there is considerable overlap in meas- 
urement among the ten traits, and they can 
be reduced to a set of three more general 
traits—intelligence, physical energy, and so- 
cial adjustment. The net result is a set of 
traits which are not complicated by situa- 
tional factors and are indeed general in na- 
ture. 

How meaningful are these general traits as 
parameters or basic concepts? Intelligence 
suggests the notion of abilities and powers, a 
concept which has been widely employed by 
psychometricians. It provides the basis for 
the principle that the individual who succeeds 
is the one with the prerequisite abilities and 
powers. Physical Energy is most closely re- 
flected by traits of Energy and Initiative, 
Physical Ability, Leadership, and Motivation 
for Assignment. These traits are related to 
the concepts of interest and motivation. 
These are concepts which have been widely 
used by personality psychologists. These 
concepts: provide the basis for the statement 
that ability alone is not sufficient for success 
in school or in industry; interest and motiva- 
tion must also be taken into account. Social 
adjustment introduces a third parameter 
which is the major concern of social psychol- 
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ogists. In a social situation ability and mo- 
tivation are not sufficient to account for an 
individual's success. It is necessary that his 
behavior be coordinated ‘with the behavior of 
others in the group. With some reinterpre- 
tation these “factors”—intelligence, physical 
energy, and social adjustment—appear to be 
meaningful and useful. They might well 
serve as a first approximation to a set of 
general concepts about human organisms. 

A third implication which follows from 
the identification of kinds of situations and 
general traits is the necessity for relating the 
two. It may appear that in asserting the ex- 
istence of kinds of situations which affect 
the rating of a particular trait the present 
writer is refuting the validity of using traits 
to describe individuals. This, however, is 
not the only possible interpretation of the 
data. While the fallacy of a strict trait theory 
of behavior needs to be emphasized, at the 
same time the usefulness of general traits 
should not be overlooked. To obtain a more 
complete theory of behavior the two should 
be related to each other. One way of relat- 
ing general traits such as ability, motivation, 
and social adjustments to kinds of situations 
is to define the traits within the context of 
a kind of situation in question. 

If we are interested in a particular individ- 
ual, we can ask: In what kinds of situations 
does he possess ability? What kinds of atti- 
tudes and interests does he demonstrate to- 
ward different kinds of situations? What is 
the extent of his social adjustment in different 
kinds of social situations? This leads to a 
study of an individual's personality in terms 
of the varying roles he adopts in different 
kinds of situations. If we are interested in a 
particular situation, we can begin to ask ques- 
tions formerly directed in the context of a 
single general situation in the context now 
of different kinds of situations which people 
meet. Instead of seeking “primary” abilities, 
we can investigate the abilities required in a 
“verbal” or “active” situation. Instead of 
seeking a list of basic motives, we can investi- 
gate the interests, attitudes, and initiative in- 
dividuals demonstrate in a particular kind of 
situation. Instead of seeking a general index 
of social adjustment for an individual, we can 
investigate the extent of individuals’ adjust- 
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ment in a particular social situation. This 
leads naturally to the study of social inter- 
action. In every instance the general trait of 
an individual would be defined in the context 
of a kind of situation. 

If such an approach, which takes into ac- 
count both the organism and the situation, 
appears unduly complex, it has the advantage 
of being realistic. The situations need not be 
artificial situations, as the OSS staff has dem- 
onstrated. They can be made to reflect com- 
plex life situations, as in the Brook situation. 
Similarly, the abilities can be complex abili- 
ties, such as the ability to persuade others to 
adopt one’s own point of view, as in the De- 
bate situation. For the purpose of prediction 
it is probably more efficient to know a per- 
son’s complex abilities in a “social-verbal” or 
“social-active” situation than to know his 
scores in more “primary” abilities such as 
memory, reasoning, word facility. The 
fear thac situations are too numerous to be 
brought into a theoretical framework is not 
justified since they can be grouped into gen- 
eral kinds of situations, as the factor analyses 
of situations have revealed. 

In this connection it should be pointed out 
that in the factor analysis of mental abilities 
the failure to distinguish abilities from situa- 
tions has produced confusing primary mental 
abilities. Many of the primary mental abili- 
ties appear suspiciously more like situational 
factors than abilities. One might conceive of 
memory and reasoning as abilities. But what 
about spatial relations, number, and word 
fluency? These refer more to objects in the 
test situation—blocks and geometric figures, 
numbers, words (1, pp. 313-315). By bring- 
ing in the concept of situation it may be pos 
sible to clear up this confusion between abili- 
ties and test situations. One possibility is to 
neutralize situational factors by using an 
over-all rating based on ratings in different 
situations. 

The factor analyses of the OSS data do not 
repudiate entirely the concepts of general sit- 
uations or general traits. On the other hand, 
they do deny the adequacy of a strict trait 
theory which ignores the nature of the situa- 
tion if the simple rating scale is valid. They 
also indicate the kinds of general traits which 
might be useful as basic concepts in an analy- 
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sis which takes into account different kinds 
of situations. 


SUMMARY 


The factor analyses of four traits rated in 
different situations reveal at least two differ- 
ent general kinds of situations which affect 
the rating of traits. These are the “verbal” 
and the “active” situations. The factor analy- 
sis of the over-all rating of 10 traits suggests 
three general concepts which appear to be 
useful in the analysis of the behavior of in- 
dividuals in a situation. These concepts are 
abilities, interest and motivation, and social 
adjustment. It is concluded that it is desira- 
ble to relate these basic concepts describing 
the organism to general kinds of situations to 
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achieve a more realistic study of human be- 
havior. 
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CRITIQUES AND NOTES 


NEW EVIDENCE FOR THE VALIDITY OF TAYLOR’S MANIFEST 
ANXIETY SCALE 


WAYNE H. HOLTZMAN, ALLEN D. CALVIN, AND M. E. BITTERMAN 
The University of Texas 


| recent years considerable interest has been 
shown in the development of a simple and ob- 
jective measure of anxiety to be used for the selec- 
tion of criterion groups in experimental studies. 
One measure, which has been extensively employed 
for this purpose, was derived by Taylor (8) from 
the Minnesota Multiphasic Personality Inventory. 
Approximately 200 items from the MMPI were 
submitted to five clinical psychologists who were 
asked. to designate those which they judged to be 
indicative of manifest anxiety. After items upon 
which there was less than 80 per cent agreement 
among the judges were eliminated, 65 remained 
which constituted the original Manifest Anxiety 
Scale. The odd-even reliability of the scale (cor- 
rected for length) was found to be .92, (3, p. 96). 
In an extensive normative study, Hedlund, Farber, 
and Bechtoldt (2) administered the Taylor scale to 
1971 college students. These investigators reported 
test-retest reliabilities of .82 after a five-month pe- 
riod and .81 after a longer period (nine to seven- 
teen months). They noted no appreciable sex dif- 
ferences, no practice effects, and no differences in 
scores attributable to variations in filler items. 
Although ease of administration and objectivity 
of scoring, as well as high statistical and temporal 
reliability, are desirable characteristics, the Taylor 
scale has thus far been validated only indirectly. 
On the assumption that anxiety functions as a 
motivational state (drive), Taylor deduced from 
Hull’s theory (8) that anxiety level should be cor- 
related with acquisition rate in “defense” condition- 
ing situations. This expectation was realized in two 
experiments with the conditioned eyelid response in 
which. criterion groups were selected from the ex- 
treme ends of the distribution of Taylor scores (7, 
8). Hilgard, Jones, and Kaplan (3), studying the 
eyelid response, and Bitterman and Holtzman (1), 
working with the galvanic skin response, found no 
significant relation between Taylor scores and sim- 
ple conditioning; in these experiments the subjects 
were much more homogeneous with respect to 
anxiety scores, and the negative findings suggest 
that the discriminability of the Taylor index is 
limited over the middle range. Hilgard, Jones, and 
Kaplan did, however, report a significant negative 
correlation between Taylor scores and conditioned 
discrimination, and similar results were obtained 
by Rosenbaum (6). A study by Lucas (5) on 
verbal learning also suggests the validity of the 
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Taylor scale. The performance of low-scoring sub- 
jects was found to improve in the face of repeated 
“failure” while the performance of high-scoring 
subjects deteriorated. It is the purpose of the pres- 
ent note to provide a more direct demonstration of 
the validity of the Taylor scale on the basis of data 
which have recently become available. 

Independently of Taylor, Winne (9) has con- 
structed a somewhat similar scale by an empirical 
analysis of items from the neurotic triad (Hs, D, 
and Hy) of the MMPI. Using 140 normals and a 
matched group of 140 neurotics in a VA Mental 
Hygiene Clinic, Winne found 33 items which dif- 
ferentiated between the criterion groups at or be- 
yond the .o1 level of significance. The 30 items 
that discriminated most significantly were incor- 
porated in a Scale of Neuroticism, which showed a 
corrected split-half reliability of .86 for the normal 
group and .84 for the neurotic group. Cross valida- 
tion of the scale with a fresh sample of male vet- 
erans, normal and neurotic, resulted in a correla- 
tion of .53 between scale score and class member- 
ship. When neurotics who did not manifest anxiety 
were excluded, the effectiveness of discrimination 
improved markedly, suggesting that the test was 
particuiarly sensitive as a measure of anxiety. 

In view ot the careful empirical validation of 
Winne’s scale, strong support for the validity of 
Taylor’s scale would be provided by a high correla- 
tion between the two independently derived meas- 
ures. Lack of a relationship, although not conclu- 
sive, would cast some doubt upon the validity of 
the Manifest Anxiety Scale. To examine this pos- 
sibility, Taylor and Winne scores were obtained 
from the MMPI protocols of 348 college women at 
The University of Texas. Since the items of the 
Taylor scale are not restricted to the neurotic triad, 
as are those of the Winne scale, a third score was 


1The number of items used in computing Taylor 
scores has, for some reason, varied considerably in dif- 
ferent experiments. The scores in the present study 
were based upon a list of 49 items kindly provided by 
Dr. Taylor (personal communication) in May, 1951. 
In terms of the raw score equivalents of the 20th and 
8oth percentiles, the present distribution is identical with 
that reported by Spence and Taylor (7) for a 50-item 
version. The MMPI numbers of the 49 items here 
employed are: 7, 13, 14, 18, 31, 32, 43, 67, 86, 107, 
125, 142, 158, 163, 186, 190, 191, 217, 230, 238, 241, 
242, 263, 264, 287, 288, 301, 317, 321, 322, 335, 337, 
340, 352, 361, 371, 397, 407, 418, 424, 431, 439, 499, 
506, 523, 528, 530, 549, 555- 
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obtained for each subject by counting only those 
items in the Taylor scale which were taken from 
the Hy, D, and Hs scales of the MMPI.2 Compari- 
son of scores on this new scale, which will be desig- 
nated Tn, with scores on the Winne scale should 
reveal the amount of overlap resulting from mark- 
edly different procedures for selecting items from 
a common pool. 

A plot of Winne 
revealed a slightly curvilinear relationship. 


scores against Taylor scores 
Conse- 
quently, a correlation ratio for each regression was 
computed along with a product-moment coefficient 
of correlation. Since inspection of the scatter-dia- 
gram obtained by plotting Winne scores against Tn 
scores revealed no departure from linearity, only a 
The re- 
sults of this analysis are presented in Table 1. The 


Winne and 


product-moment coefficient was computed. 


correlation between Taylor scores is 


TABLE 1 


MANnirEst ANx- 
or NEUROTICISM 
FoR 348 CoLt_teceE WoMEN 


CorreLaTions BeTweEEN TAayLor’s 


1ETY ScALE AND WINNE’s SCALE 


CoRRELATION WITH WINNE 


SCALE : 
. OsTaINneD CoRRECTED 


Taylor (49 Items) 
Product-moment 74 86 
Correlation ratio (TW) 
Correlation ratio (WT) 


Tn (19 Items) 


Product-moment 


unusually high in spite of widely divergent ap 
proaches to test construction which resulted in 
scales having only seven items in common.? The 
fact that the Jn scale is still more highly cor- 
related with the Winne probably is due in great 
part to the substantial increase in the percentage of 
overlapping items achieved by eliminating those 
items of the Taylor scale which were not drawn 
from the neurotic triad. 

Of interest are the results obtained when these 
are corrected for attenuation due to 
the unreliability of each scale. From the reported 
statistical reliability of the 65-item Taylor scale 
(.92), the reliability of the 49-item scale was esti- 
mated, by applying the Spearman-Brown prophecy 


correlations 


2 The MMPI numbers Gf these 20 items are: 7, 18, 32, 
43, 67, 86, 107, 125, 142, 158, 163, 186, 190, 191, 230, 
238, 241, 242, 263, 258. 


8 The MMPI numbers of these items are: 43, 107, 186, 
190, 191, 238, 242. 
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formula, to be .89. The relationship between Winne 
and Taylor scores rises from .74 to .86. Since the 
estimates of reliability (.89 and .84) must, if any- 
thing, be spuriously high because they have been 
computed by the split-half method, the corrected 
coefficient of .86 is a conservative estimate of the 
“true” relationship between Winne and Taylor 
scores. By the Spearman-Brown prophecy formula 
the reliability of the 20-item Tn scale was esti- 
mated to be .77. The correlation between the Tn 
and Winne scales, corrected for attenuation, is .99, 
suggesting that the “true” relationship is almost 
perfect in spite of the fact that only seven items 
are common to both scales. 

The usual practice in experimental studies is to 
administer the Taylor Manifest Anxiety Scale to a 
large number of subjects and to select criterion 
groups of “anxious” and “nonanxious” individuals 
from the two ends of the distribution of scores. 
The cutting points used have varied from upper and 
lower 20 per cent (7) to upper 9 and lower 12 per 
cent (8). In view of the high relationship between 
Winne and Taylor scores, explicit selection on one 
scale involves a high degree of implicit selection on 
the other. We now have, therefore, direct evidence 


of the validity of criterion groups used in the re- 
cent experiments on the relation of anxiety to learn- 
ing—evidence which until now has been lacking. 
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IDENTIFICATION AND THE ADOLESCENT BOY’S PERCEPTION OF 
HIS FATHER? 


ESTHER LADEN CAVA AND HAROLD L. RAUSH 
University of Michigan 


nTiL recently there has been little experi- 
U mental investigation of the correlates of the 
process of identification. This paper reports a 
study of the relationship between conflict in identifi- 
cation with like-sex parent, as measured by a projec- 
tive technique, and perceived communality of inter- 
ests, activities and traits. The hypothesis under in- 
vestigation may be stated as follows: those individ- 
uals who show greater conflict in areas of person- 
ality related to the identification process will in- 
directly perceive themselves as less similar to their 
like-sex parent than will those who show less con- 
flict in these areas. 

The Blacky Pictures (1, 2) were utilized to give 
an index of disturbance in the area of identification. 
The following dimensions of the test were chosen 
for investigation because of their theoretical rela- 
tionship to the identification process (4): (@) Ocdi- 
pal Intensity, (4) Castration Anxiety, (¢) Identifica- 
tion, and (d) Ego Ideal. A fifth index representing 
an over-all estimate of conflict in these four areas was 
also used. 

For an indirect measure of perceived similarities, 
adolescent male subjects were given the Strong Vo- 
cational Interest Blank and at a later date were re- 
quested to fill out Strong blanks as they guessed 
their fathers would. 

The subjects were 37 twelfth-grade high school 
boys,? the majority from families of lower socio- 
economic status. The students were seen as a sin- 
gle group for a total of three times: at the first ses- 
sion the Strong was administered with the usual 
directions; at the second session three days later they 
completed the blanks as they guessed their fathers 
would answer the items; five days later as part of 
the procedure they were given the group administra- 
tion of the Blacky. 


RESULTS 


The scores for the Blacky dimensions were calcu- 
lated from Blum’s revised scoring directions (3). 
Subjects were accordingly classified as weak or strong 


1 This study is based on the undergraduate honors 
dissertation of Mrs. Cava(5) who is now at Boston 
University. Another part of the research deals with 
identification and need for achievement. The association 
between conflict in identification and low motivation 
toward achievement will be discussed in McClelland, 
D. C., Atkinson, J. W., Clark, R. A., & Lowell, E. L. 
The achievement motive. (In preparation.) 

2 The authors are indebted to N. G. Wiltse, principal 
of Ypsilanti High School, and to the teachers for pro- 
viding subjects and facilities for the study. 
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for each dimension, a Weak score denoting less 
conflict in that dimension than a strong score. Cut- 
ting points for classification were chosen so that the 
number of subjects in each of the two categories 
would be as nearly equal as possible. Finally, the 
sums of the scores for the four dimensions were 
added to give a Total Identification Score. 

The score for perceived similarities was given by 
the number of items on the Strong Vocational In- 
terest Blank on which the subject showed agree- 
ment between his own responses and the responses 
he attributed to his father. Consequently, the 
greater the number of items a subject answered in 
the same way for his father and himself, the higher 
was his score. 

Perceived similarities were compared for strong 
and weak conflict groups on the Blacky for each of 
the four dimensions and for the over-all Total 
Identification Score. As may be seen in Table 1, 
the differences were all in the expected direction. 
That is, boys who showed less conflict in each of 
the dimensions relating to identification on the 
Blacky tended to perceive their fathers as more like 
themselves in interests, activities, and traits than did 
boys who showed greater conflict on the Blacky 
dimensions. The differences were significant, how- 
ever, for only the Castration Anxiety Dimension 
(p=.o1) and for the Total Identification Score 
(p=.03), although differences for Oedipal Intensity 
and Identification Dimensions tended to approach 
significance (p=.06 and .08, respectively). The 
distribution for the Total Identification Score per- 
mitted the calculation of a product-moment coefhi- 
cient with perceived similarity scores. The r of 
—.36 was significant at p<.05, in the predicted 
direction. 

It should be noted that this study did not deal 
with actual similarities between fathers and sons, 
but rather with sons’ indirectly expressed percep- 
tions of their fathers. Nor did the study involve 
degree of identification. The interiorization of pa- 
rental modes and values, and its extent, is a highly 
complex process which need not, in terms of theory, 
appear directly in overtly expressed attitudes. It is 
of interest that conflict in areas related to identifica- 
tion by psychoanalytic theory, and as measured by 
a test based on psychoanalytic theory, bears some 
predictable relation to the way male adolescents 
perceive their fathers on such interests, values, and 
traits as are measured by the Strong Vocational In- 
terest Blank. 
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Esruer Lapen Cava AND Harotp L. Rausu 


TABLE 1 


Amount or Conriict ReveaLep BY THE BLacky Pictures AND NuMBER oF FaTHER-SON AGREEMENTS AS 
Perceived BY SONS ON THE STRONG BLANK 








MEAN 
AGREEMENTS 


DEGREE OF 


Biacky DIMENSION 
CONFLICT 


N 





Strong 
Weak 
Strong 
Weak 
Strong 


194.45 
214.64 
187.89 
215.58 
194.38 
Weak 212.50 
Strong 195.06 
Weak 205.65 
Strong 191.50 
Weak 215.19 


Oedipal Intensity 
Castration Anxiety 
Identification 


go Ide al 


Total Identification 


23 1462.89 
14 
18 1361.50 
19 
21 1481.14 
16 
16 1611.83 
20 
20 1463.04 
16 








* All probabilitic “single-ended” 


J n only. 
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HAWTHORNE ON THE THERAPEUTIC ROLE 


HOWARD BRAND 
The University of Connecticut 


OMETIMES a psychologist is abruptly taken out of 
his concern with methods and theories by the 
a different kind of observer of inter- 
personal relations. Here is an excerpt from The 
Scarlet Letter} by Nathaniel Hawthorne bearing 
upon the role of the physician in relation to his 


sensitivity of 


patient. 


Thus Roger Chillingworth scrutinized his patient care- 
fully, both as he saw him in his ordinary life, keeping 
an accustomed pathway in the range of thoughts familiar 
to him, and as he appeared when thrown amidst other 
moral scenery, the novelty of which might call out 
something new to the surface of his character. He 
deemed it essential, it would seem, to know the man, 
before attempting to do him good. Wherever there is 
a heart and an intellect, the diseases of the physical 
frame are tinged with the peculiarities of these. In 
Arthur Dimmesdale, thought and imagination were so 
active, and sensibility so intense, that the bodily infirmity 
would be likely to have its groundwork there. So 
Roger Chillingworth—the man of skill, the kind and 
friendly physician—strove to go deep into his patient's 
bosom, delving among his principles, prying into his 
recollections, and probing everything with a cautious 
touch, like a treasure-secker in a dark cavern. Few 
secrets can escape an investigator, who has opportunity 
and license to undertake such a quest, and skill to 
follow it up. A man burdened with a secret should 
especially avoid the intimacy of his physician. If the 
latter possess native sagacity, and a nameless something 


The scarlet letter. 
Pocket Books, 1952. 


1 HAWTHORNE, NATHANIEL 


Ed., 185 New York 


(2nd 


more,—let us call it intuition; if he show no intrusive 
egotism, nor disagreeably prominent characteristics of 
his own; if he have the power, which must be born with 
him, to bring his mind into such affinity with his 
patient's, that this last shall unawares have spoken what 
he imagines himself only to have thought; if such reve- 
lations be received without tumult, and acknowledged 
not so often by an uttered sympathy as by silence, an 
inarticulate breath, and here and there a word, to indi- 
cate that ali is understood; if to these qualifications of a 
confidant be joined the advantages afforded by his recog- 
nized character as a physician,—then, at some inevitable 
moment, will the soul of the sufferer be dissolved, and 
flow, forth in a dark, but transparent stream, bringing all 
its mysteries into the daylight . . .” (pp. 127-128). 


This excerpt strikes the writer as a startling com- 
mentary upon the current status of psychotherapy. 
Hawthorne writing over one hundred years ago 
states the problem of the therapeutic relationship 
with a sensitivity and cogency equaled by few psy- 
chologists today. Has our psychological science 
offered any further amplification upon the nature of 
interpersonal relations? Does our skill in methods 
and theory offer greater understanding of the psy- 
chotherapeutic relationship? Do we know very 
much more than what Hawthorne has written? It 
is sobering to observe that our skills and our tech- 
niques are matched by the acumen of a sensitive 
writer on human relations. 


Received March 3, 


1952. 





REVIEWS 


Tue PrepicTion oF PERFORMANCE IN CLINICAL, 
Psycuo.ocy. By E. Lowell Kelly and Donald W. 
Fiske. Ann Arbor: Univer. of Michigan Press, 
1951. Pp. xv+311. $5.00. (Lithoprinted).? 


Some time ago a “pure” experimental psycholo- 
gist of considerable prominence remarked to me 
that the Division of Evaluation and Measurement 
and the Division of Experimental Psychology 
should merge as the Division of Scientific Psy- 
chology. Aside from the smug implication that 
some 89 per cent of APA members are not scien- 
tific, it can be presumed that our “pure” experi- 
mentalist failed to recognize the fact that many 
members of Division 5 are interested in measure- 
ment for the purpose of making practical predic- 
tions. This raises the question as to whether 
prediction programs are sufficiently well grounded 
to be acceptable to the purist who might have 
some tolerance for applied psychology. 

To attempt a general answer to this question 
is beyond the limits of this paper. It is our belief, 
however, that some notion of the current status 
of prediction can be obtained by a brief appraisal 
of what may be regarded as the most ambitious 
prediction project ever launched outside the armed 
services—the VA project on the prediction of per- 
formance in clincial psychology. Perhaps our 
clinical friends will not object to a “case” study 
of this effort. 

This book by Kelly and Fiske gives a concise 
report of the oceans of data and mountains of 
statistical analyses involved in the project. Although 
the two authors “assumed the primary responsibili- 
ties for the project from its inception to the 
writing” of the report, the project represents the 
work of 16 consultants on experimental design 
and planning, plus 89 others who were members 
of assessment staffs, plus 23 others who served on 
the professional ‘and technical staff. Ten doctoral 
dissertations have emerged from the project. The 
manuscript of the report was critically read by 
13 persons. With the foregoing information gleaned 
from the Preface, one proceeds to read the report 
with great expectations. 

During the academic year 1946-47 pilot studies 
led to the formulation of a plan for a full-scale 
assessment program to be carried out during the 
summer of 1947 on 137 trainees, the main group 
of subjects. An abbreviated assessment, i.¢e., with- 
out interviews or situation tests, was made during 
the summer of 1948 on a second group of 143 

1 This review is also the address of the retiring presi- 


dent of Division 5 of the APA, delivered at Washington, 
D. C., September, 1952. [Editor’s note] 


candidates for training in clinical psychology. In 
1949, the 1947 cases were reassessed by advanced 
trainees on the basis of the paper-and-pencil 
materials collected during the summer of 1947. 
The actual N’s for most of the analyses tend to 
be considerably less than 137 and 143. 

Meanwhile some thought and work were devoted 
to the criterion problem, but it appears that seri- 
ous planning thereon did not get under way until 
1949. The final criterion data were collected dur- 
ing the autumn of 1950, hence the criterion 
measures came after three years of training for 
the 1947 assessment group and after two years 
for the 1948 group. 

For the main (1947) group the predictor meas- 
ures included four projective tests: Rorschach, 
Sentence Completion, TAT, and Bender-Gestalt; 
and a total of ror objective test scores on each 
person. These objective scores were derived from 
the following tests: Cooperative General Culture, 
Miller Analogies, Chicago Primary Mental Abilities, 
Allport-Vernon Study of Values, Kuder Preference 
Record, Strong Vocational Interest Blank (includ- 
ing special keys), Guilford-Martin Battery, and 
MMPI (including Gough keys). Still other bases 
for assessment were a sociometric questionnaire, 
a 131-item biographical inventory, a structured auto- 
biography, three OSS-type situation tests, plus 
intensive interviews and informal social contacts 
during a nine-day assessment period. 

Much of the foregoing material was utilized in 
various ways by three assessors who eventually 
arrived at final pooled ratings on 31 personality 
traits (R. B. Cattell type) and on 11 general and 
specific skill variables having to do with the job 
of being a clinical psychologist. There is the 
presumption here that clinicians know what clini- 
cians are doing as clinicians! Actually, the ratings 
on these 11 skill variables, which were defined 
by a kind of job analysis of the profession of cli- 
nical psychologist, were predictive judgments as 
to future performance for which it was hoped to 
have, in due course of time, criterion measures 
of actual performance. 

Since these so-called skill variables, for which 
predictive ratings were made, serve also to define 
the criterion variables, they are listed herewith: 
Academic Performance, Diagnostic Competence, 
Individual Psychotherapy, Group Psychotherapy, 
Research Competence, Administrative Competence, 
Supervisory Competence, Competence in Teaching, 
Professional and Interpersonal Relations, Integrity, 
and Over-all Clinical Competence. 

The principal criterion measures were a com- 
posite of ratings by colleagues, university supervisors 
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and installation supervisors on each of the above 
skill except Administrative and Teach- 
ing Competence. Preference for Hiring was also 
included as a criterion measure, and rating on 
Liking, i.e., likability, was included as a criterion 
and as an assessment variable. Additional criterion 
data were obtained by tests devised to measure 
diagnostic competence and competence in therapy, 
a content examination in general and clinical psy- 
chology, and self-report on research activity. 
The statistical analysis of the data is concerned 
primarily with the reliabilities of the many ratings, 
the interrelationships of the various predictors, the 
criterion meas- 


‘ 
variables, 


relationships among the several 
ures, and finally the validational study in which 
the predictor measures are pitted against the crite- 
rion measures. Over 3700 correlation coefficients 
are reported! 

Obviously, we camnot here do more than sum- 
marize the principal findings. Since ratings are 
a crucial part of the study, it is pertinent to point 
out that the median reliability for the final pooled 
predictive assessment ratings on the skill variables 
is near .g0, which is somewhat inflated since the 
judges’ ratings were not arrived at entirely inde- 
pendently. The median intercorrelation of the 
predictive ratings is .71. Our own computations 
indicate that the ratings on Over-all Suitability, as 
a predictor of General Clinical Competence, cor- 
relate .g3 with the common (that is, first centroid) 
factor in the predictive ratings. 

The median interjudge agreement on the crite- 
rion ratings is .63; presumably composites based on 
ratings by university and installation supervisors 
and by colleagues would have reliabilities in the 
.80’s. The median intercorrelation of the criterion 
ratings is .60; the ratings on General Clinical Com- 
petence correlate highly with the first centroid of 
the criterion ratings—a loading of .gr. 

The results of greatest interest are, of course, 
the success with which the criterion measures 
were predicted by what is probably the most 
comprehensive and costliest scheme ever utilized 
in psychological prediction. The correlations be- 
tween the pooled assessment predictive ratings and 
their corresponding criterion measures (ratings) 
range from .52 for Research Competence down 
to .16 for Diagnostic Competence., The median 
correlation is .30, and the correlation between the 
centroid of the predictive ratings and the centroid 


of the criterion ratings is, by our caléulation, about 


.40. 
When one turns to the objective tests as predic- 
tors one finds that Miller Analogies correlates .47 
with rated Academic and that the 
Strong Interest Blank (Psychologist Key) correlates 
.43 with rated Research Competence. For none 
of the other rated criterion measures do we find 
above .35. 


Performance 


objective tests yielding correlations 
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Apparently the objective personality tests are useless 
as predictors. For the Guilford-Martin Battery, 
1.1 per cent of 182 correlations are significant at 
the 1 per cent level and none of the 130 1r’s for 
the regularly scored MMPI reaches the 1 per cent 
level! Predictions based on the Rorschach and 
TAT fare no better, whereas the Bender-Gestalt 
and Sentence Completion yield some low-order 
significant correlations as do also a few of the 31 
Cattell-type personality traits. 

In general, predictions by way of the elaborate 
ratings are no better or worse than 
predictions by way of certain selected test scores. 
Academic achievement and research can be pre- 
dicted more accurately than those performances 
unique to clinical work. Predictions of the latter 
are not accurate enough to have practical worth. 

With the foregoing brief summary of proce- 
dures and results in mind we may entertain the 
general question: Why did such an extensive and 
expensive study yield so little of positive value? 
More specifically: Are there worth-while negative 
findings? Was the statistical analysis faulty? Was 
there wasted effort? What should have been done 
that wasn’t? What should be done next? 

It will be recalled that when the project was 
initiated in 1946 there was considerable excitement 
about the OSS program, hence it is 
understandable that this study should become an 
assessment project. Perhaps an important nega- 
tive finding is the failure of the intensive interview 
and the OSS type of situation test to increase 
appreciably the accuracy of the predictions. Be 
it noted that the assessors in this study had far 
more information regarding the type of activity 
to be demanded of the assessee than was possessed 
or even guessed at by the OSS assessors. 

Regarding the statistical analysis we have only 
a minor quarrel. When one has over 1400 validity 
coeficients and adopts the 5 per cent level for 
judging statistical significance, one must be wary 
of the possible 70 correlations that will by chance 
reach the adopted level of significance. Stated 
differently, if one picks out the best predictors on 
the basis of the highest correlations, one definitely 
For instance, the best test 


assessment 


assessment 


capitalizes on chance. 
predictors for two criterion measures come from 
the Guilford-Martin Battery but these two validities 
(r’s of .30) happen to constitute the 1.1 per cent 
of the Guilford-Martin correlations reaching the 
1 per cent level of significance. It is also disturbing 
to find repeated emphasis on the statistical signifi- 
cance of r’s as low as .20—a case of statistical 
significance having alienated the coefficient of alien- 
ation from the activity of the cerebral cortex. 
When one turns to the question of possible 
wasted effort, one runs the risk of confusing 
hindsight with foresight, yet all too frequently as 
we read the research plan and before seeing any 
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results we raised many questions. It did not 
require very sagacious foresight to predict that the 
Guilford-Martin Battery, the MMPI, and the 
Rorschach would yield nothing. They didn’t. It 
was also our forethought that the other projective 
techniques would yield littl—the TAT yielded 
nothing, but the Sentence Completion and the 
Bender-Gestalt did yield a little. Why were these 
various personality tests included? Were they 
included because of specific hypotheses or were they 
included as part of a fishing expedition with little 
thought as to effective bait? One searches the 
report in vain for a more definitive rationale than 
the statement “We might have used a few tests 
and procedures which seemed highly promising as 
predictors; instead we decided to include representa- 
each of several approaches 


36). A reveal- 


tive procedures from 
to personality measurement” (p. 
ing statement! 

However, when the reader reaches page 198 
he easily gets the impression that certain tests and 
procedures were included because some one had 
“faith,” “high confidence,” or “high hopes” in 
their predictive value. That Kelly and Fiske, as 
analyzers and reporters, were not borne aloft by 
the faith and hope of their clincial brethren is 
evidenced by their discussion of certain correlations 
between tests and criterion measures. If a rationale 
existed for the use of a given test, and if Kelly 
and Fiske accepted that rationale, it is difficult to 
understand remarks such as “these correlations are 
not easy to interpret,” or a correlation is “of 
interest,” or “two interesting validities,” or “inter- 
esting is the finding,” or this relationship “is 
somewhat unexpected.” Then there is the con- 
spicuous absence of any discussion of the fact that 
a prediction based on the Bender-Gestalt correlated 
.41 with rated Research Competence. Perhaps this 
was just another unexpected r, difficult to rationalize. 

From our biased viewpoint, the next most impor- 
tant waste of effort was the factor analysis of the 
relationships among the 42 assessment ratings, i.¢., 
among the 31 Cattell-type personality traits plus the 
11 clinical skill variables. We are told that “This 
elaborate factor analysis was undertaken primarily 
as a guide for later analyses and interpretation of 
assessment data” (p. 60), but there is nothing in 
the report to indicate that use was made of the 
results of this factor analysis. 

We come now to a part of the study which is 
so obviously crucial that it seems almost trivial to 
mention it. This has to do with the selection of 
criteria and the devising of adequate measures 
thereof. We are told that “from the beginning, 
the project’s staff and the consultants . . . were 
acutely aware of the fact that the criterion problem 
would be much more difficult than the assessment 
problem” (p. 72). Then follows a discussion of 
the difficulties, difficulties which none of us can 
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deny, but difficulties that should have somehow 
been overcome before going on with the assess- 
ment program. To the present reviewer it is 
sheer nonsense to have proceeded with an exten- 
sive testing and assessment prediction program 
without having first devised satisfactory measures 
of that which was to be predicted. 

Let us discuss briefly the belated efforts at estab- 
lishing criterion measures. Evidently it was decided 
early to think of the several aspects of the criterion 
in terms of the skills which we have already listed. 
In addition to the Composite Criterion rating on 
Academic Performance, a content examination was 
constructed as a measure of academic achievement. 
We are not told the correlation between this con- 
tent examination and the criterion ratings on 
Academic Performance. 

A test of diagnostic prediction was devised as 
a measure of Diagnostic Competence, but since it 
correlates exactly zero with rated Diagnostic Com- 
petence we are left wondering which, if either, 
should be considered as a criterion measuré 

Five measures of Therapeutic Competence were 
invented, but these correlate so low with each 
other and with the Composite Criterion rating 
that one is puzzled as to which of these, if any, 
may be safely regarded as an adequate measure. 
Accordingly, we agree with Kelly and Fiske when 
they say, “Since we do not have an acceptable 
measure of therapeutic competence, it is not sur- 


prising that we are not able to predict it!” (p. 182). 
As to Research Competence, it is admitted that 


the composite criterion ratings thereon “reflect 
little more than another evaluation of research 
promise” (p. 120). Similarly, it is admitted that 
ratings on Supervisory Competence are “more 
nearly estimates of ‘aptitude for’ than ‘measures 
of success in’ this professional function” (p. 121). 

In essence, we have as the main criterion meas- 
ures the ratings by colleagues and university and 
installation supervisors on several supposedly dif- 
ferent aspects of success in clinical psychology. 
These ratings are based on performance in train- 
ing, which may or may not correspond to later 
on-the-job performance. Except for the old standby, 
academic achievement, it cannot be claimed that 
any of these measures are adequate as criteria, and 
when one examines the correlations among these 
criterion measures one is forced to the conclusion 
that aside from Academic Performance and Re- 
search Competence, these so-called skills are mainly 
one and the same thing, when rated. Whether that 
thing is general clinical competence or just a fore- 
cast thereof or a five-star-general halo is not 
determinate. 

That both the criterion problem and the assess- 
ment problefn aré*f® somewhat of a jumble is also 
attested to by the fact that of the ten Composite 
Criterion Ratings for which there were specific 
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predictive ratings only two were best predicted by 
ratings especially aimed in their direction. Perhaps 
this needs an illustration for clarity: Diagnostic 
Competence is better predicted by the assessment 
ratings on Individual Psychotherapy than by the 
assessment ratings on Diagnostic Competence. 

After discussing .the possible inadequacies of 
criterion ratings by colleagues and supervisors, the 
report goes on to say, “We could not, however, 
justify the expense of bringing the raters together 
from all parts of the country or of sending out 
staff representatives to train raters in the use of 
this particular rating scale” (p. 88). In short, the 
prediction predicament rests on a never-resolved 
criterion crisis. 

The foregoing critical evaluation is not to be 
interpreted to mean that the project yielded noth- 
ing of value. Far from it—the report should be 
assigned reading for all clinical trainees and for 
all those who attempt selection and prediction pro- 
grams. There are many and interesting by-prod- 
ucts, and the project files contain a mass of data 
which might still prove useful. And this brings 
us to the question, What should be done next? 

First and foremost, the criterion problem should 
be satisfactorily solved. A major task, indeed. 
Then, with adequate criterion measures on the 1947 
group obtained 10 or 15 years hence, the test file 
material should be mined by item analysis. This 
is admittedly a straight empirical approach, and 
for this we make no apology since validation item 
by item is usually more fruitful than attempts at 
validating test scores involving items scored for 
some purpose other than the one at hand. The 


1948 assessment group could be used as the needed 


cross-validation group. It would be of interest 
also to pit the 1947 assessment predictive ratings 
against really adequate criteria. If appreciable cor- 
relations emerged we would have some evidence 
that assessment was worth while; if no appreciable 
correlations resulted we would not have mean- 
ingful negative findings since it would not be fair 
to judge the assessors against criterion measures 
which they did not have in mind when making 
predictions. 

Instead of following the foregoing suggestions 
for a salvaging job, it might be more fruitful to 
start from scratch in the search for predictors. The 
obvious next step, when and if the criterion prob- 
lem is solved, is to build tests specifically designed 
for the job of predicting those aspects of the 
criterion for which adequate measures have been 
devised. This will call for more thought than is 
required to draw a representative sample of tests 
and procedures from the psychologists’ warehouse. 

Our perusal of this report prompts us to raise 
some general questions. First, if psychologists with 
ample funds and the best available personnel at 
their disposal cannot some up with a better solu- 
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tion to the prediction problem than achieved by 
this project, the question may be asked as to 
whether we haven't oversold certain psychological 
services. Second, are psychologists maintaining a 
desirable balance between basic scientific research 
and work in thé applied areas? Third, why have 
our efforts at describing personality led to traits 
and measurements which have so little social 
utility? Fourth, how much longer do we tolerate 
the clinician’s claim that his methods of making 
predictions are superior to the objective regression- 
equation technique? Fifth, how long do we con- 
tinue our attempts at predictions for groups known 
to be homogeneous and then explain our lack of 
success by saying that a given group is, after all, 
too homogeneous to permit predictions? 

Our next question concerns a breaa methodo- 
logical issue. This project was largely a coopera- 
tive effort. We hear much propaganda these days 
about the advantages of large-scale programs of 
research, involving professional personnel with 
diverse training and viewpoints. Could Kelly and 
Fiske give us an evaluation, pointing out the dis- 
advantages as well as the advantages of such large- 
scale research? Reading between the lines in their 
report leads us to believe that such programs are 
not without severe handicaps. 

In summary, the major prediction project of our 
time fell far short of expectations. Its fall involved 
tripping over the criterion problem despite the 
recent experiences of wartime psychologists with 
this old stumbling block. With its feet enmeshed 
n a web of uncritically seiected tests, and its head 
overburdened with clinical intuitions, it never 
attained equilibrium, but landed prostrate in a 
field already strewn with efforts, noble and other- 
wise. Its collapse might have been forestalled by 
the injection of fruitful new ideas, but such ideas 
are indeed hard to come by. 

Quinn McNemar 


Stanford University. 


By N. Cameron and Ann 
The Riverside Press, 


BeHavior PaTHOLocy. 
Magaret. Cambridge: 
1951. Pp. xiii+645. 


This book presents the point of view that patho- 
logical behavior stems from faulty social learning. 
The person’s inability to adopt a variety of social 
roles and his invalid social assumptions lead him 
to distorted or ineffective methods of tension reduc- 
tion. The authors are primarily concerned with 
showing . the ways in which he overplays 
and underplays the various techniques of adjust- 
ing to strain and anxiety. If we trace these reac- 
tions through a lifetime of learning, always in a 
social context, we shall arrive at length at a 
genuine comprehension of the symptoms and 
syndromes which define the patient's pathology” 
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(p. 9). Accordingly, the first part of the book 
deals with tensions and the social learning process. 
Needs, reaction sensitivities, language, self-evalu- 
tion, social roles, and emotional reactions are 
traced in terms of the child’s response to social 
molding. The next portion of the book discusses 
the effects of fixation during the maturing proc- 
ess; two kinds of immaturity are described, one 
stemming from intellectual retardation and the 
other from social deviation. The central chapters 
of the book present major varieties of pathological 
behavior. Diagnostic categories are not described 
as separate entities; as the genesis of each abnormal 
reaction is analyzed, the syndromes in which it 
plays a central part are discussed. Thus depres- 
sions, manic states, and anxiety reactions are sub- 
sumed as examples of pathological reactions to 
anxiety, following a discussion of the place of 
conflict and anxiety in daily life. Similarly repres- 
sion is first presented as a normal reaction and 
then as the source of compulsive and hysterical 
symptoms. Delusions, hallucinations, disorganiza- 
tion, desocialization, and deterioration are treated 
separately in the same way. The final two chapters 
give a brief account of various forms of therapy 
and a theoretical discussion of therapy in relation 
to learning. 

Consistency and maturity of theory underlie the 
explanatory formulations. Although the book is 


not presented as a systematic treatise, it merits 


serious attention as a new attempt to formulate 
the essential characteristics of pathological behavior. 
At the risk of misinterpretation and oversimpli- 
fitation, some of the highlights may be sum- 
marized as follows: Needs are defined as increased 
or protracted activities and tensions consequent 
on disequilibria in the organism. Needs may be 
learned as responses to environmental stimulation; 
they are hierarchically organized, but the rela- 
tive position of each is constantly changing depend- 
ing on the extent to which its tensions are undis- 
charged. Stability of personality structure is appar- 
ently produced by enduring reaction patterns rather 
than by a stable and unique hierarchy of needs. 
Unconsummated behavior-sequences, i.e., those 
which do not give the anticipated tension release, 
lead to anxiety. Such sequences tend to be con- 
summated in any way possible, to relieve the anxiety. 
Once a method of reducing anxiety has been 
learned it will persist, however invalid it may be, 
if the patient knows of no more successful method. 
This results in ever-increasing overlearning of 
the faulty technique. This formulation accounts 
for the persistence of painful and self-defeating 
behavior. Emotional reactions are defined as 
“ reactions whose non-specific visceral con- 
tribution dominates or determines them, or colors 
them distinctively” (p. 124). Emotions as subjec- 
tive states are not given systematic consideration. 
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Because visceral reactions persist past the immediate 
occasion of their arousal, by a_ process of simple 
association irrelevant factors take on the prevailing 
emotional tone; phobias may be elaborated in 
this fashion, for example. Learned associations 
set up anticipations. Special sensitivity develops 
toward need-relevant factors in the environment. 
Special insensitivities also develop to exclude 
irrelevant factors. It is not clear whether these 
“reaction sensitivities” are initiated by chance 
experiences of tension reduction or whether 
relevance is immediately perceived because of some 
pre-existing attitude. In any case, this formula- 
tion provides directionality in behavior. 

In ordinary situations, the person responds to 
incomplete cues on the basis of learned anticipa- 
tions. Skillful anticipation is vital for prompt 
and valid response to changing environmental 
events. (While it is never explicitly defined, 
“valid” seems to mean realistic in terms of the 
factual situation.) Other people’s behavioral pat- 
terns, i.c., their social roles, must be comprehended 
swiftly if adjustment is to be rapid. For example, 
the meaning of an outstreched hand must be per- 
ceived in time to respond appropriately. Many 
social roles must be practiced and learned in fan- 
tasy and in group play. Private interpretations are 
normally shared with others, and the adequately 
socialized person accepts the consensus of opinion, 
hence learns socially valid anticipations. Sheer lack 
of practice in role-taking may create social awkward- 
ness leading to anxiety. If this is relieved by social 
withdrawal, unshared and potentially unvalidated 
social anticipations result. The person may ulti- 
mately construct a “pseudocommunity” based purely 
on his own private reaction patterns in their pro- 
jected form. Delusions and hallucinations are 
explained on this basis. 

As the infant learns to anticipate his mother’s 
pleased or excited response to the sounds he makes, 
he begins to acquire language. In time, values 
become attached to certain words, then to the acts 
they symbolize, and finally to the self as actor, e.g., 
“bad, dirty” becomes “I am bad when I am dirty.” 
The authors seem to feel that it would be almost 
impossible to learn values and self-reactions of 
praise or blame without the aid of verbal symbols. 
They admit the existence of wordless imagery but 
do not consider that it plays a major role in the 
development of the child’s evaluative reactions. 

The small child usually accompanies his play 
with a running verbal account of his own feelings 
and intentions. Soon he notices that his public 
avowals bring overt and often unwelcome responses 
from the environment. Eventually he learns verbal 
self-control; he finds he can talk silently to himself 
and conscious inner speech becomes established. 
He also learns behavioral duplicity, i.e., to say one 
thing while thinking another. Finally he starts 
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to practice behavioral duplicity toward himself. 
The truth is rendered nonaccessible to his 
self-reactions (unconscious), and he fills the void 


own 


by verbalizing acceptable feelings or attitudes to 
This analysis of the development 
one of the 


himself instead. 


of behavioral duplicity seems to be 
strikingly insightful sections of the book. Repres- 
sion is formally defined as “. . . inhibition of one’s 


own tension-provoking reactions with the result 


that they become inaccessible to one’s self-reactions” 


(p. 341). 
perhaps purely verbal, in the distinction between 


There seems to be a lack of clarity, 
incomplete repression as illustrated by hysterical 
fugue and over-repression as indicated by hysterical 
amnesias. 

Biosocial 
than being treated as a certain kind of organiza- 
person. Mature behavior is the 
role-taking 
Fixation is defined as 


maturity is normatively defined rather 


tion within the 
achievement of the expected of the 


person within his culture. 


acquired reaction until it is 
immature for the person’s age, 


People ordinarily called psycho- 


persistence of an 
judged biosocially 
sex, and status. 
pathic deviates, as well as those with intellectual 
thus fixated in 

+ 


Regression is the occurrence of 


retardation, are considered to be 
certain 
less mature behavior in a previously more mature 
person, whether or not this particular immaturity 

This heavy reliance on the 
“cultural norm” seems out of 


respects. 


had occurred before. 
vague abstraction of 
keeping with the careful specificity of the rest of 
the book. Certainly the authors are not so naive 
as to believe that acquiescence to and participation 
in prevailing rol 


maturity; by default, however, one is left with the 


es is the sole requisite for social 


impression that there is only one variety of nor- 
mality for any given social status. 
excellence of this attempt to 
interpret abnormal behavior in a unified way in 
eclectic of other recent 
books, perhaps one should not be disappointed 
that th i 
to a logical conclusion. 
conflict frustration remains 
There is an excellent analysis of 
the usual varieties of conflict but no special con- 
sideration of the techniques for resolving them. 
While examples are given of the overlapping of 
mutually contradictory roles, no special weight is 
given to this sort of conflict. 

We are not told personal integrity is 
maintained despite rapid shifts from role to role. 
Self-consistency, self-direction, planning, and indi- 
viduality are not discussed in relation to personality 
disintegration. In fact, the usual ego or self seems 
completely unnecessary to this theoretical system. 
An unintegrated collection of learned needs, reac- 
tions, anticipations, and special sensitivities takes 
its place. Insatiable needs, often thought to be 


Considering the 


comparison to the chaos 
systematic analysis is not always pushed 
For example, the exact 
relation between and 


unformulated. 


how 
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factors producing individuality, are not considered 
here. Inherent urges toward growth, autonomy, 
self-realization, and the achievement of a mean- 
ingful relationship to the environment are not 
assumed. The single power system postulated is 
the urge for restoration of equilibrium. From 
time to time the need for meaning is brought in, 
apparently as a special form of disequilibrium; 
human beings exhibit an irresistible need 
to account for what goes on around them 
a frightening or catastrophic explanation is more 
acceptable than none. ... The need for explana- 
tion which rationalization satisfies is sufficient justi- 
fication for the practice of rationalizing” (p. 380). 
In the light of this quotation, it is surprising that 
the unstructured situation is not presented as a 
primary source of anxiety. No concept is furnished 
to account for structure and differentiation within 
the total environment, no substitute for the cogni- 
tive or phenomenal field. The destruction of 
normal integration within the perceived environ- 
ment is well described in the schizo- 
phrenic thought, but the 
inclusion and interpenetration are not interpreted 
in gestalt terms. 

Actually, the nature of the system 
appears to be behavioristic. Observable reactions 
rather than the patients’ phenomenal experiences 
are analyzed. The analysis of learning appears 
to be a sophisticated application of conditioning 
theory. This should make the book especially 
welcome ‘«o the large number of departments 
committed to a behavioristic and operational orienta- 
tion. Those who place a stronger emphasis on 
fundamental strivings, and those who prefer a 
field-theortical approach will find it a stimulating 
challenge to their own thinking. 

From a didactic point of view the book has 
many specific assets. Symptoms are not over- 
simplified. The authors have not committed the 
error of presenting the cause. For example, they 
point out that a home poor in social techniques 
may not produce a poorly socialized child if 
teachers and peers provide a corrective influence, 
but that such outside influences cannot operate if 
the child has learned to ignore them. Experiments 
and cases cited as examples are given in enough 
detail to be useful. Footnote references suggest 
outside readings rather than merely serving to 
document the statements in the text. Page refer- 
ences are given to previous and forthcoming con- 
siderations of the topic being presented. This 
helps to keep the total trend of the argument in 
mind. 

The relation of this book to Cameron’s previous 
volume, The Psychology of Behavior Disorders, is 
not quite clear to this reviewer. Undoubtedly the 
earlier book was much more useful as an elemen- 
tary classroom text. The organization was clearer, 


section on 


phenomena of over- 


over-all 
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more topics were covered, syndromes were pre- 
sented one by one, the language was simpler, and 
above all the student was encouraged to try to 
grasp the patient’s point of view. The present 
volume is more objective, in the sense of dealing 
with observable behavior rather than inner experi- 
The language and vocabulary are much 
more difficult. Terms are introduced without 
detailed explanation as though the reader could 
be assumed to have an elementary knowledge of 
the field. The theoretical assumptions are much 
the same, but are expressed more systematically. 
Except for the chapters on social and intellectual 
immaturity, no topics are introduced. The 
end result is a thorough exploration of the bio 
social view as applied to selected major techniques 
Thus the book seems suitable as 
The reader 
superficial 


ences, 


new 


of adjustment. 
a text for an intensive second course. 

should have a general but perhaps 
acquaintance with the usual dynamics of human 
development and the usual diagnostic criteria. Medi 
cal students, social workers, teachers, and intel! 
gent laymen, as well as advanced clinical psycholo 
gists, might benefit from such a detailed grasp of 
the biosocial view. 


i 


Marcaret M. Rices 


The Vineland Training School. 


Tue Curnica, Metuop 1n PsycuHo.ocy. By Rabert 
I, Watson. New York: Harper, 1951. Pp. 
xii+779. $5.00. 


A high level of competence and common sense 


characterizes this book. The author’s exposition 
is remarkably free from serious errors of judgment, 
and a spirit of fair mindedness and freedom from 
dogma pervades the pages. The point of view 
toward the clinical method is one which exalts 
neither the ultra-scientific approach nor the purely 
intuitive “subjective” approach. Designed prima- 
rily as a graduate text for the student of clinical 
psychology, it very adequately fulfills its function 
and has much to offer the clinician. The reviewer, 
however, questions its appropriateness for advanced 
undergraduates (another group for which the 
book was written) except as a reference for occa- 
sional chapter assignments. 

About two-thirds of the book is devoted to a 
discussion of diagnostic appraisal and about one- 
third to psychotherapy. Sixteen tests are chosen 
for extended treatment, covering the age range 
from infancy to adulthood. The tests of intel- 
lectual functioning are the Arthur Point Scale, 
the Cattell Infant Test, the Porteus Maze, the 
Stanford-Binet and the Wechsler-Bellevue. With 
this group is included the Vineland Social Matu- 
rity Scale. Memory tests are the Graham-Kendall 
Memory-for-Designs Test, Digit Span, and Wechsler 
Memory. Tests of conceptual thinking include 
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the Healy II, the Kohs Block Design, the Object 
Sorting Test, the Goldstein-Scheerer Cube Test, 
and the Weigl Color-Form Sorting Test. Tests 
of personality functioning are represented by the 
Rosenzweig P-F Study and the Thematic Apper- 
ception Test. 

Disagreements with an author over what should 
have been included in his book are seldom reward- 
ing; however, since one of Watson’s criteria for 
selection was that the test should be “in common 
use in clinical settings” and since, all other things 
being equal, students should learn about tests 
which are widely used, it becomes appropriate 
to inquire into the choices the author made. 
Fortunately the Committee on Diagnostic Devices 
of the Division of Clinical and Abnormal Psy- 
chology issued a report about a year ago based 
on a survey of tests in actual use by_clinical psy- 
chologists. A comparison of the author’s slections 
with the Committee’s findings shows that only 
nine of his selections are found among the 
twenty-five tests in most common use. Five of 
the tests chosen by the author do not even appear 
in the total list of forty-five. It might be argued 
that some of the less frequently employed tests 
presented meet some of the other criteria of selec- 
tion used by the author, e.g., reliability, validity, 
“normative breadth,” but such tests as the Object 
Sorting, the Weigl Color-Form Sorting, and the 
Goldstein-Scheerer Cube tests can hardly be said 
to be outstanding in these respects. In fact, it 
might be said that there is very little evidence 
at present that any one of the three criteria is 
satishied by them. 

The author’s reason for omitting the Rorschach 
(that the presentation would use up a dispropor- 
tionate amount of space) and his reason for not 
considering vocational interest tests (that they are 
not strictly clinical tools) both seem valid. His 
grounds for the omission of personality question- 
naires, specifically the Minnesota Multiphasic test, 
appear to the reviewer to be less valid. He states 
that such inventories are either screening devices 
and thus not needed in dealing with patients who 
are already identified as patients, or they are 
imperfect substitutes for the interview. He neg- 
lects the use of the MMPI as a supplement to 
the interview and as one of the measures of clini- 
cal status before and after therapy. The fact 
that the MMPI ranked tenth in actual clinical 
use by psychologists as determined by the Commit- 
tee on Diagnostic Devices, a higher rank than 
all but three of the tests discussed by the author, 
would argue that it has at least a moderate degree 
of value in the clinic. 

Watson explains that he included only two pro- 
jective methods since “generally the student of 
clinical psychology is exposed to a course in pro- 
jective techniques as such”—an argument which, 
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of course, is more appropriate in accounting for 
omitting any discussion of. projective devices what- 
soever. Perhaps the selection of all the tests is 
best explained by Watson’s desire to bring together 
a number of instruments which would be appropri- 
ate for the most commonly met diagnostic prob- 
and in so doing he exercised his clini- 
Undoubtedly personal preference 


lems, 
cal judgment. 
entered in. 

It is in the actual exposition and discussion of 
the tests that Watson is at his best. Although he 
may be censured for over-lengthy descriptions of 
each test, in the reviewer's opinion, this is prefera 
ble to the unduly brief treatments so often encoun- 
tered. The author's criticisms, when adverse, are 
almost invariably well reasoned and well phrased. 
Some of the especially well presented sections of 
the book are those dealing with the TAT, the 
uses and limitations of test norms, and test validity. 
Perhaps exception can be taken to his statement 
that “the best criteria of .validity develop from 
the clinician’s day-to-day experience with the instru- 
ment” (p. 114). It is true that this is an impor- 
tant source of validation, but the history of clinical 
opinion has certainly demonstrated its weaknesses. 

Watson presents a careful review of the literature 
on the Wechsler-Bellevue’s diagnostic scatter pat- 
terns and deterioration index, concluding with 
respect to the former, “The results . . . are equivo- 
cal and to some degree disappointing. This dis- 
appointment, however, should be tempered by a 
realization that, after all, such procedures devoted 
to the assignment of diagnostic labels are a rather 
insignificant portion of the labors of the profes- 
207). As to 
says in 


sional clinical psychologist” (p. 
Wechsler’s measure of deterioration, he 
summary, “As yet there has been little positive 
evidence to substantiate the contentions of Wechs- 
ler and his associates regarding the deterioration 
index” (p. 214). With both these conclusions, 
the reviewer is in hearty agreement. 

A few minor criticisms of his section on diag- 
nostic tests are the following. (a) He omits an 
analysis of the broader meaning of Wechsler’s 
age-corrected IQ. (5) He accepts very shaky 
evidence that the Wechsler-Bellevue is more effec- 
tive in diagnosing mental deficiency than the 1916 
Stanford-Binet. (c) He evidently believes that 
making up a definition of intelligence on the part 
of the test constructor somehow enhances the 
value of the test since Wechsler is praised for his 
definition, and Terman and Merrill are criticized 
for not intelligence. (d) He rather 
uncritically representativeness of 
Wechsler’s occupational the . stand- 
ardization population. 

The chapters on diagnostic appraisal, covering 
such topics as the case study, observation, inter- 
views, and procedures in diagnostic testing, are on 


defining 
accepts the 
sampling in 
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the whole very effective. Outstanding is his 
comparison of clinical and scientific methods, and 
he. has some very wise remarks on the role of the 
clinical psychologist when serving as a consultant 
to other professional personnel. One weakness in 
these chapters is that the author attempts to give 
material pertinent to a wide variety of clinical 
organizations and to a wide variety of patients. 
Inevitably many of his suggestions fit one situa- 
tion much better than another, and some would be 
contraindicated under certain circumstances. This 
section also suffers from lack of case illustrations. 

The material concerning the mental status exami- 
nation is weak in that it is too long for a mere 
description of what such an examination is like 
and yet is not sufficiently detailed to serve as a 
guide. The whole approach smacks of the mental 
hospital and would be only. partially appropriate 
in many other settings. In considering the case 
study, too much stress is placed on the gathering 
of information from the patient, e.g., “In all cases 
it is important to elicit as much information as 
possible” (p. 42). No mention is made of the 
feelings of the patient while the implied probing 
is going on. In a similar vein, it is suggested that 
every possible source of information be tapped 
(even including the statements of neighbors) but 
no mention is made of the fact that most patients 
need to give their consent before such sources 
are approached. 

According to the author, “The accounts of psy- 
chotherapeutic systems are intended to be primarily 
descriptive rather than critical.” His chapters are 
in accord with his intentions and yet his critical 
statements, when he makes them, are almost 
invariably well conceived and judicious. His 
treatment of common factors and variations in 
psychotherapy is very well organized and meaty. 
And despite the author’s presumed predilection for 
eclectic therapy, his accounts of the psychoanalytic 
approach and of nondirective therapy are not 
unfair. He makes many thoughtful comments on 
the Rogerian method. The reviewer found the 
chapters on psychotherapy with children, group 
psychotherapy, and indirect methods of treatment 
less rewarding than the others. They were not 
inadequate, surely, but they did not rise much 
above the level of a competent summary of the 
literature. 

As a whole this is a very satisfying and reward- 
ing book. Its faults are few and its virtues are 
manifold. The prospective buyer should be warned, 
however, that the publishers seem to have stumbled 
in their book making. The reviewer's copy, at 
least, has no printed matter on pages 694, 695, 
714, and 715. 

Rosert C. CHALLMAN 

Minneapolis, Minnesota. 
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